The international journal of science /23 April 2020 


natur 


1 


a 
) 
Ee _ 
} | How extreme rainfall might 
aS have helped trigger the 2018 
: A Kilauea eruption 
ca aes ¥y 
<; 4 
. ~ \ 
x 
be 
> 
Climate scenarios Cartrouble WOE . Rate of change Fe i a 
Fresh approaches to The cost errors Predicting how fast ; ; 
modelling the future stalling the switch biodiversity willalter~ § 
of planet Earth togreen vehicles ina warming world B 


The international journal of science /23 April 2020 


nature 


Freezing World 
Health Organization 
funding is dangerous 


Researchers everywhere must challenge 
Donald Trump’s undermining of the global 
health agency. 


midthe biggest global health crisis inacentury, 
it is dangerous to hurt the one intergovern- 
mental agency most able to guide the world 
out of it. But that is precisely what happened 
this week. 

On the orders of President Donald Trump, the United 
States, one of the key founding members of — and the 
largest donor to — the World Health Organization (WHO), 
announced its intention to freeze its funding for the agency, 
pending a review of WHO actions during the coronavirus 
pandemic. The review could last for up to three months. 

Trump’s administration has been increasingly critical 
of the agency, which for months has guided the world in 
howto tackle the deadly coronavirus. That work — and the 
WHO’s other life-saving interventions around the world 
— will be at risk if the agency loses its US funding, which 
amounted to nearly US$900 million for 2018-19. 

It isn’t yet clear whether the White House can with- 
hold this funding — especially the portion that has been 
approved by the US Congress — and if so, how much it 
can keep back. But even talk of doing so in the middle ofa 
global health and economic crisis cannot be condemned 
strongly enough. 

De-funding the WHO is especially dangerous for those 
low-income countries in which the agency’s work is crucial 
to maintaining standards of public-health infrastructure, 
and also to tackling killer diseases. The WHO's epidemi- 
ologists, clinicians and logistics personnel are right now 
overseeing more than 35 emergency operations, includ- 
ing a measles outbreak in the Democratic Republic of the 
Congo and acholera outbreak in Yemen. 

On top of its emergency operations, the WHO handles 
ongoing efforts to treat tuberculosis and diabetes; erad- 
icate polio; and study tropical diseases. This is all on an 
annual budget of roughly $2.4 billion. Of this, the WHO’s 
emergency-response budget is approximately $280 mil- 
lion. By contrast, the agency that tackles public-health 
emergencies in the United States — the Centers for Dis- 
ease Control and Prevention — has a total budget of around 
$12.7 billion this year. 


Finding the balance 

A pandemic is always a big test for the WHO. In previous 
health emergencies, the agency has been criticized for act- 
ing too slowly, or — inthe case of the 2009 HIN1 influenza 
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The agency 
has offered 
leadership 
and acted 
according to 
the evidence 
ithas 
received.” 


pandemic — overstating the risks. But leading public-health 
researchers and practitioners agree that, so far in the cur- 
rent crisis, the agency has offered leadership and acted 
according to the evidence it has received. 

The WHO was notified of a cluster of pneumonia cases 
by China on 31 December, and it began an emergency-re- 
sponse process the following day. Its many actions since 
then include posting and updating guidance on howto 
diagnose COVID-19, vetting diagnostic tests and distribut- 
ing them around the world. The agency’s science division 
convened world experts to survey potential therapeutics. 
From this, it developed an adaptable clinical-trial protocol, 
known as SOLIDARITY, that has been launched globally. 

More recently, the WHO has set up a supply-chain man- 
agement system totry to ensure that low-income countries 
are not left without tests, medical equipment or protective 
gear for health workers — given the fierce competition for 
these limited resources. 

The WHO declared a public health emergency of inter- 
national concern, or PHEIC, on 30 January. That announce- 
mentis atrigger for the agency’s member governments to 
follow its recommendations. These include establishing a 
comprehensive programme of testing, quarantining peo- 
ple suspected to be infected, and tracing their contacts. 

Some countries acted quickly, including Germany, Singa- 
pore and South Korea. But the United States is among those 
that has not followed these particular recommendations. 
Even now, it does not have a national infrastructure for 
testing for the virus, nor for tracing the contacts of those 
infected with it. 

In early March, WHO director-general Tedros Adhanom 
Ghebreyesus pleaded with the world when he said: “You 
can’t fight a virus if you don’t know where itis. That means 
robust surveillance to find, isolate, test and treat every 
case, to break the chains of transmission.” 

But the Trump administration chose not to follow the 
WHO’s advice. Instead, influential lawmakers have been 
calling for an investigation into the WHO’s actions, claim- 
ing that the agency was too slowto sound the alarm and too 
deferential to the Chinese government. At the same time, 
they are implicating the WHO in wider questions being 
directed at China’s government. These include that China 
could have acted more quickly to lock down in the days 
after the first outbreak, and that public officials withheld 
important information. Such questions must be asked of 
China, but they are not for the WHO — which acts at the 
behest of governments — to answer. And they are not 
reasons to de-fund the agency. 

Itis, of course, crucial that lessons are learnt from all 
stages of this pandemic. Once itis over, there will be many 
national and international investigations and inquiries — 
including the WHO’s own — and these will uncover what 
went right, what went wrong and what could have been 
done better. It is always tough to operate in a pandemic, 
and tougher still when essential cooperation between gov- 
ernments is at a low ebb. Such inquiries will be an oppor- 
tunity to improve and to grow. They are not a reason to 
undermine or attack. 

This pandemic needs the world to follow a coordinated 
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plan, covering decisions including how and when lock- 
downs areto be relaxed. It is extraordinary that more than 
three months into the outbreak, sucha planis nowhere to 
beseen. On19 April, the health ministers of the G20 group 
of nations met virtually. They must put sucha plan in place. 
The best hope of achieving that is for all nations to work 
with the WHO and other international agencies. 

Itis right that researchers, funders and governments 
have been protesting against Trump’s decision, and they 
must continue to do so in the strongest terms. Those in 
the United States must also lobby their lawmakers at every 
level. The president and his administration must not with- 
hold funding from the WHO. Doing so will place more lives 
at risk and ensure that the world takes longer to emerge 
from this crisis. 

Nearly 70 years ago, the United States was instrumental 
in helping to establish the WHO. Nations realized that they 
needed such an agency in part because they couldn't tackle 
pandemics by acting alone. Itis asad indictment of the state 
of our world that the agency is now having to fight for its 
future while doing the job it was created to do. 

We need to support the WHO soitis at its strongest, not 
undermine it at such a crucial hour. 


Climate action and 
poverty alleviation 
go hand-in-hand 


The world urgently needs a post-pandemic 
consensus on tackling climate change. 


or the first time since its inception 50 years ago, 

this year’s Earth Day, on 22 April, will coincide 

with the fleeting prospect of a lower carbon 

footprint, as the fastest economic slowdown the 

world has ever seen has grounded transport and 
closed workplaces. 

The ‘new normal’ — as some are calling it — also comes 
at huge social and economic cost. As Nature went to press, 
the SARS-CoV-2 coronavirus had taken more than170,000 
lives, anumber that will continue to rise. And the pan- 
demic has also precipitated an unprecedented economic 
shock. Worldwide, tens of millions have been made unem- 
ployed. For now, governments are rightly focusing on 
spending trillions of dollars to keep health-care systems 
functioning, to pay for rising welfare costs and to support 
companies to prevent more workers losing their jobs. 

But, at the same time, many carbon-intensive industries 
in coal, oil and gas are queuing up for bailouts. Govern- 
ments need to resist. Before the pandemic, momentum 
was building towards decarbonization — for example, 
through commitments from governments on net-zero 
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emissions and through green new deals. This work must 
not be undone. 

But a greener post-pandemic future cannot come atthe 
expense of livelihoods — particularly those of the lowest 
paid and those in developing countries. The United Nations 
is forecasting that a drop in demand from high-income 
nations means that low- and middle-income countries will 
lose hundreds of billions of dollars in export earnings in 
2020. Without urgent research and action, many of these 
countries are looking at vast numbers of their citizens 
staying out of work. 


Polluter pays 


Fortunately, there’s one action that could contribute to 
easing some of the coming hardships and, at the same 
time, ensure that development continues on a sustain- 
able path. After the 1992 Rio Earth Summit, developed 
nations pledged to help developing nations with research 
and development and with green financing. This wasn’t aid 
so muchas an application of the ‘polluter pays’ principle. 
Many of the richer countries had recognized that their 
actions had caused climate change. And they agreed that 
they had aresponsibility to fund less-developed countries, 
both to help those nations become more resilient to the 
effects of global warming, and so that those countries 
could continue to develop, albeit in greener ways. 

Adecade ago, developed countries pledged to channel 
US$100 billion annually to developing nations in climate 
finance by 2020. But — as we reported in September (Nature 
573, 328-331; 2019) — only $71 billion reached its destina- 
tionin 2017, and this was mostly in loans, not grants. Inthe 
context of today’s bailouts, these are not onerous sums. 
Worldwide, some $2.4 trillion a year will be needed for 
the next 15 years just to transform energy systems to keep 
global temperatures from exceeding 1.5 °C above pre-in- 
dustrial levels. Asthe economic crisis deepens, more loans 
are being offered by multilateral lending agencies. But 
loans are no substitute for the failure to keep past promises. 

It’s unfortunate that the next Conference of the 
Parties to the UN’s climate convention — due to take place 
in Glasgow, UK, in November — has had to be postponed, 
because this is where developed nations would have been 
reminded of their obligations. However, in the spirit of cur- 
rent work-pattern adjustments, this meeting — or at least 
preparations for it — could still take place virtually. The 
coming economic stimulus packages must include finance 
for greener development. And long-promised funding for 
developing countries must also be made good. 

The pandemic has taught the world a sharp lesson in 
what happens when there is a swift economic shock. A simi- 
lar shock could lie ahead — as economists have long warned 
— if action is not taken to curb climate change. The Inter- 
national Monetary Fund is projecting that growthin most 
countries is likely to bounce back in 2021 if lockdowns do 
not persist. But the world might not be so resilient should 
suchashock result from extreme climate events, or rising 
sea levels. 

That is why greener forms of growth must remain a 
priority. But development must be equitable, too. 
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A personal take on science and society 


World view 


By Emma Marris 


To mark the 50th Earth Day, 
take collective action 


Reducing your own carbon footprint is r | r | 
not as powerful as calling governments 
and companies to account. It pays to 


be strategic 


verybody can and should do something to protect when 

our shared home, Earth. But it pays to be strat- deciding 

egic when deciding what action to take. In recent a 

years, | have joined the board of a local climate what action 

non-profit group, marched, written local and | CO take.” 
national editorials, and even been hauled to jail for occu- 
pying the Oregon state capitol to protest against a pipeline 
project. And I’ve never felt more sure of myself. 

For at least the past decade, environmental groups have 
been suggesting we reduce individual emissions by altering 
our behaviour: choosing public transport, eating less meat, 
buying more efficient light bulbs. In fact, it was the oil giant 
BP that popularized the ‘personal carbon footprint’ in 2005. 
This focus has kept individuals working on their own impact 
while letting governments, and corporations that profit 
from climate change, off the hook. 

This year, 22 April marked the SOth annual Earth Day, a 
day of protest in favour of environmental protection. It is 
past time to shift our focus to policies that can get at the root 
cause of the problem: the extraction and burning of fossil 
fuels. The transition to zero-emissions energy needs to hap- 
pen quickly — but also equitably, without making energy 
unobtainable for poor and marginalized people. By acting 
together to demand this, we can all have an impact much 
larger than whatever reductions we can make as individuals. 

The average person in an industrialized country is 
responsible for around 10 tonnes of carbon emissions per 
year, so that’s arough limit to what anyone can accomplish 
by addressing only their own footprint. If someone were 
to join a campaign to close down a coal-powered plant, 
and it prevailed, that could help to eliminate 10 million 
tonnes of carbona year — much more than any individual 
could prevent over a whole lifetime. And helping to pass 
laws that shut downall coal plantsina single country could 
multiply that impact. 

Scientists are well positioned to contribute to such 
campaigns. They have skills that can be valuable in this 
fight, even if their area of expertise has nothing to do with 
atmospheric chemistry, soil science, electrical engineer- 
ing, conservation planning or any of the hundreds of other 
specialisms that directly intersect with the problem. 

Scientific habits of mind can help anyone to identify | Emma Marris 
where they have the most leverage. Do youjoin10,000 | writes about the 
people to push for a national policy change, 1,000 peo- | environment from 
ple to demand your employer divest from fossil fuels, or | Oregon. 

10 people to push for a new bike lane by your house? Any | e-mail: e.marris@ 
choice will be good, and success at any level willdo much | gmail.com 


more to stop climate change than alifetime of green living. 

Everyone who works for an employer, is part of acommu- 
nity, or isa member of afield has several possible spheres 
of influence. They can push their employer to divest from 
fossil fuels; support students who are demanding changes 
across a university system; lobby agovernment agency to 
include climate impacts in decision-making; or even goon 
strike iftheir employer continues to pursue profits that are 
tangled up with torching the climate. They can go to coun- 
cil meetings and recommend that their community sets 
emissions targets or invests in community-run renewable 
energy or public transport. They can push for their pro- 
fessional organizations to make commitments and public 
statements. They can take to the streets and protest; they 
can run for office; they can volunteer to take the muffins 
to the next meeting of the local climate-activism group. 

Climate change is not the only environmental problem 
facing Earth — there are other major threats to the diversity 
of life. Working to institute policies that protect complex 
ecosystems from clearance and development can also have 
high rewards. There might be species on the edge of extinc- 
tionin your area that you can help by attending community 
meetings, writing comments during government planning 
processes or lobbying your local representatives. 

Equally, fighting for the rights of people of colour, Indig- 
enous people, people from sexual and gender minorities, 
poor people and members of other marginalized groupsis 
a powerful way of engaging in the fight for climate justice. 
When everyone really has equal power, policies that allow 
historically favoured groups to profit from the ongoing and 
deepening misery of others will be changed. 

There are as many ways to engage with collective action 
as there are people. But the key word is collective. By all 
means, green up your personal life. It can help, and often 
comes with other benefits, as well. But we should prioritize 
changing the systems we all live in that make it difficult and 
expensive to make green choices. 

You might worry that advocating on behalf of the climate 
puts your reputation as a dispassionate, rational scientist 
at risk. Isympathize. As a freelance writer, I worried that 
getting involved in climate activism would mean I could 
no longer claim to be an unbiased journalist. In the end, 
I decided that sharing my fears about the future, my love 
for Earth and its life, and my opinions about the changes 
we need to make to protect our home did not conflict with 
my core commitment to seek out and report the truth. 

Going public with your opinions need not conflict with 
the scientific search for truth. Nor do you have to dedicate 
your life to collective action for the environment. Even 
a couple of hours every other week can achieve great 
things. Individually, we are puny; together, we can change 
the world. 
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The world this week 


Newsin brief 


RAREBLACK-HOLE 
COLLISION OPENS 
NEW WINDOW ON 
UNIVERSE 


Gravitational-wave astronomers 
have for the first time detected 
a collision between two black 
holes of substantially different 
masses — opening up anew 
vista on astrophysics and on the 
physics of gravity. The event 
offers the first unmistakable 
evidence from these faint space- 
time ripples that one black hole 
was spinning before the merger, 
giving astronomers rare insight 
into this key property. 

“It’s an exceptional event,” 
said Maya Fishbach, an 
astrophysicist at the University 
of Chicago in Illinois, who 
unveiled the signal at a virtual 
meeting of the American 
Physical Society on 18 April. The 
US-based Laser Interferometer 
Gravitational-Wave Observatory 
(LIGO) and the Virgo 
observatory near Pisa, Italy, 
detected the collision last year. 

The observation network 
has seen mergers between 
black holes with roughly equal 
masses. Physicists had eagerly 
awaited events with black holes 
of uneven mass because they 
provide more precise ways of 
testing the general theory of 
relativity. In the latest merger, 
one black hole was around 
8 solar masses, and the other 
about 31. This imbalance made 
the larger black hole distort the 
space around it, so the other’s 
trajectory deviated froma 
perfect spiral. This could be seen 
in the resulting wave signal. 


CORONAVIRUS 
DISRUPTS RESEARCH 
FUNDING 


Although science is crucial to 
the fight against COVID-19, 
researchers unable to work 
are becoming increasingly 
concerned about how the 
coronavirus pandemic will 
affect their funding. 

Disruptions suchas the 
closure of labs and universities 
mean that researchers face 
challenges in completing 
projects and in paying lab 
members when grants run out. 

“If this situation lasts for 
more than two to three months 
it will be impossible to finish 
the projects on time,” says Juan 
Astorga-Wells, a biochemist 
at the Karolinska Institute in 
Stockholm who is involved in 
two projects supported by the 
European Union’s Horizon 2020 
research programme. 

Many of the world’s major 
research funders have put 
in place policies to support 
grant-holders. Horizon 2020's 
guidance says that researchers 
can ask to extend projects by up 
to six months, and reallocate 
funds to meet the costs of 
working remotely, or to help 
pay the salaries of researchers 
who are unable to continue with 
experiments. Projects can also 
be reoriented towards research 
on COVID-19 or coronaviruses. 

US government funders 
including the National 
Institutes of Health also offer 
grant extensions, and will 
accommodate late applications 
for funding. 

Funders have also put out 
rapid calls for proposals 
for coronavirus research. 
UK Research and Innovation 
has allocated £20 million 
(US$25 million) to the topic. 


Revealed: howa spacecraft 
will bring Mars rocks to Earth 


NASA and the European Space Agency (ESA) have 
unveiled details of their upcoming mission to retrieve 
rock samples from Mars. 

The first stage will start inJuly, when NASA launches 
its Perseverance rover (pictured) to roll around 
onthe Martian surface and collect dust and rock. 
Perseverance will land next February in the red planet’s 
Jezero crater. As it drives around for many kilometres, 
the rover will drill or scoop up material to fill around 
30 geological sampling tubes. 

Until now, it hadn't been clear how those tubes might 
get back to Earth. But after four years of designing 
and plotting, NASA and ESA have finalized a plan that 
involves sending a pair of spacecraft to Mars in 2026. 

The first spacecraft would land in Jezero 
crater. There, asmall rover would make its way to 
Perseverance, pick up the filled sampling tubes and 
transfer them to a “Mars ascent vehicle” — essentially 
asmall rocket with a container to carry the samples. 
The Mars ascent vehicle would blast off and place the 
container into Martian orbit. The second spacecraft 
would then manoeuvre itself next to the sample 
container, pick it up and fly it back to Earth, probably 
landing ina military training ground in Utah. 

Ifthe plan works, scientists will finally get their 
hands on the Martian rocks in 2031. 
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The world this week 


News in focus 


Research from China is crucial to understanding the COVID-19 pandemic. 


CHINA IS TIGHTENING ITS 
GRIP ON CORONAVIRUS 
RESEARCH FINDINGS 


Some scientists welcome government vetting because it could stop publication of 
poor-quality COVID-19 papers — others fear it isan attempt to control information. 


By Andrew Silver & David Cyranoski 


hina’s government has started 

asserting tight control over COVID-19 

research findings. Over the past two 

months, it seems to have quietly intro- 

duced policies that require scientists 
to get approval to publish — or publicize — 
their results, according to documents seen 
by Nature and some researchers. 

This fits with media reports that at least 
two Chinese universities have posted notices 
online stating that research on the virus’s 
origins needs approval from a university 
committee and the Ministry of Science and 
Technology (MOST) or Ministry of Education 
(MOE) before being submitted for publication. 


Scientists in China say the changes are 
probably a response to poor-quality studies 
onthe virus — and several welcome them. 

But some academics have suggested that 
the vetting process could delay publication 
ofimportant insights that could help to reinin 
the pandemic, and that the policies are part of 
China’s attempt to control information about 
the start of the outbreak. 

Last month, China’s foreign-ministry 
spokesperson, Zhao Lijian, made sensational 
claims that the virus might have come to the 
country from the United States, prompting 
concerns that the Chinese government’s 
statements were not always guided by sci- 
ence. Although the exact origin of the virus is 
unknown, researchers think it probably came 
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from bats and then spread to a carrier animal 
before infecting the first people somewhere 
in central China late last year. 

Awareness of the new rules is mixed among 
researchers in China. The ministries seem 
not to have posted notices about the poli- 
cies on their websites, and they have not yet 
responded to Nature’s attempts to confirm 
that they have released the documents. 

Government oversight of COVID-19 research 
seems to have started with a directive to uni- 
versities. A document that seems to be from 
the MOE, and is dated 10 March, orders insti- 
tutions to get approval from the ministry and 
the Joint Prevention and Control Mechanism, 
run by the powerful State Council, before pub- 
licly announcing results on the origin of the 
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SARS-CoV-2 virus, its transmission routes or 
treatments or vaccines. The document states 
that universities need to consider “the ques- 
tions society is concerned about” when pubili- 
cizing research on the virus. (Nature was sent 
the document, which is stamped by the MOE 
and includes the name of an agency official, by 
a researcher who did not want to comment.) 
The education ministry seems to have issued 
another order after a meeting of the Joint Pre- 
vention and Control Mechanism on 25 March, 
according to a second notice that also seems 
to come from the MOE and has been posted 
on Pincong, a Chinese-language forum. This 
notice, dated 7 April, states that studies on the 
virus’s source must be approved by a univer- 
sity academic committee and the education 
ministry’s science and technology department 
before being published in a journal or posted 
onapreprint server or blog. Academic commit- 
tees must evaluate all other COVID-19 papers 
for “academic value and timing”. The notice 
also warns that studies must not exaggerate 
the efficacy of vaccines or treatments. 
According to archived web pages, the 7 April 
notice was reproduced on the website of the 
School of Information Science and Technology 
at Fudan University in Shanghai, but was subse- 
quently removed. UK newspaper The Observer 
has reported that a similar notice was posted 
on, and then removed from, the website of the 
China University of Geosciences in Wuhan. 


Helpful policies 

Several researchers in China welcome the 
vetting process for COVID-19 studies. Alice 
Hughes, aconservation biologist at the Chinese 
Academy of Sciences (CAS) Xishuangbanna 
Tropical Botanical Garden, says the measure 
will stop the dissemination of potentially in- 
accurate and sensationalist research, suchasa 
controversial study published in the Journal of 
Medical Virology in January, which suggested 
that snakes were the virus’s host. 

Hughes says her institute’s director told her 
in late February that research on COVID-19 
required MOST approval. She has not seen 
official policy documents herself. In early 
March, she says, she had a paper approved by 
the CAS and then by MOST within 72 hours.“We 
are continuing to see China publishing papers 
onthe origins through this system,” she says. 

Zhang Zhigang, an evolutionary micro- 
biologist at Yunnan University in Kunming who 
published onthe outbreak’s origins before the 
vetting process camein, also thinks it’s a good 
way to control research quality and reliability. 

But news of the policies hasn't reached all 
scientists or institutions. Chen Jin-Ping, an 
animal-disease researcher at the Guangdong 
Institute of Applied Biological Resources in 
Guangzhou who is also studying the virus’s 
origins, says he hasn’t been told that he needs 
ministry approval for his research to be pub- 
lished. And Fei Ma, dean of research and 
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graduate studies at Xi’an Jiaotong-Liverpool 
University in Suzhou, China, says he hasn’t 
heard of the need for coronavirus-related 
research to be approved by MOST or other 
government agencies. 

Denis Simon, executive vice-chancellor at 
Duke Kunshan University, says his institute 
hasn't received any official notices, but that 
researchers are discussing the issue. 

Some researchers outside China fear the 
vetting process could hold up the release of 
important research. “Right now we desper- 
ately need all kinds of research relating to 
SARS-CoV-2, from basic studies to understand 
mechanisms of disease to vaccines and ther- 
apeutics,” says Ashley St. John, a virologist at 
the Duke-NUS Medical School in Singapore. 


“We can’t afford any delays right now.” 

Understanding the origin of SARS-CoV-2 
could also lead to warning systems for virus 
spillovers from animals to people, she says. 

Sarah Cobey, an infectious-disease 
researcher at the University of Chicago in 
Illinois, adds that it would be problematic 
if results from China were being filtered or 
suppressed for reasons other than quality. 
Observations of viral spread across countries 
inform the use of interventions such as social 
distancing, she says. 

“If the research presents a biased picture, 
much of the record can eventually be corrected 
through studies of SARS-CoV-2 elsewhere,” 
she says, “but the distortion and delay would 
probably come at the cost of human health.” 


COVID-19 COULD RUIN 
WEATHER FORECASTS 
AND CLIMATE RECORDS 


As environmental-monitoring projects go dark, data 
that stretch back for decades are about to get gappy. 


By Giuliana Viglione 


wice each year, Ed Dever’s group at 

Oregon State University in Corvallis 

heads out to sea off the Oregon and 

Washington coasts to refurbish and 

clean more than 100 delicate sensors 
that make up one segment ofa US$44-million- 
per-year scientific network called the Ocean 
Observatories Initiative. “If this had been a 
normal year, I would have been at sea right 
now,’ he says. 

Instead, Dever is one of many scientists side- 
lined by the coronavirus pandemic, watching 
from afar as precious field data disappear and 
instruments degrade. The scientific pause 
could imperil weather forecasts and threaten 
long-standing climate studies. In some cases, 
researchers are expecting gaps in data that 
have been collected regularly for decades. 
“The break in the scientific record is probably 
unprecedented,” says Frank Davis, an ecologist 
at the University of California, Santa Barbara. 

Davis is executive director of the Long Term 
Ecological Research (LTER) programme, a 


“The break in the scientific 
record is probably 
unprecedented.” 
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network of 30 sites stretching from the far 
north of Alaska all the way down to Antarctica. 
Consisting of both urban and rural locations, 
the LTER network allows scientists to study 
ecological processes over decades — from 
the impact of dwindling snowfalls on the 
mountains of Colorado to the effects of pol- 
lution in a Baltimore stream. At some sites, 
this might be the first interruption in more 
than 40 years, he says. “That’s painful for the 
scientists involved.” 


Weather forecasting takes a hit 


Other monitoring programmes are facing 
similar gaps. Scientists often ride along on 
the commercial container ships that criss- 
cross the world’s oceans, collecting data 
and deploying a variety of instruments that 
measure weather, as well as currents and 
other properties of the ocean. Most of those 
ships are still running, but travel restrictions 
mean that scientists are not allowed on board, 
says Justine Parks, a marine technician who 
manages one such programme at the Scripps 
Institution of Oceanography in La Jolla, 
California. 

Port strikes and political instability have 
halted specific cruises in the past, Parks says. 
But, to her knowledge, this is the first time that 
the entire programme has shut down for an 
extended period of time. 

Measurements made at sea areimportant for 
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Scientists are skipping trips meant to maintain sensors for the Ocean Observatories Initiative. 


forecasting weather over the oceans, as well as 
for keeping longer-term records of ocean health 
and climate change, says Emma Heslop, a pro- 
gramme specialist in ocean observations at the 
Intergovernmental Oceanographic Commis- 
sion in Paris. Her group is still trying to assess 
the extent of the damage that the pandemic 
is doing to the ocean-observing community 
as a whole, but researchers are already feeling 
some effects. Over the past two months, they've 
seen steadily declining numbers of shipboard 
observations — amounting, since the beginning 
of February, to a15% loss of stations reporting 
data. And although the community is working 
hard to figure out other ways to collect impor- 
tant data, the situationis likely to worsen as the 
pandemic stretches on. “The longer the restric- 
tions are in place,’ Parks says, “the longer it will 
take for our operations to recover.” 
Commercial flights provide invaluable 
weather data, too — measuring temperature, 
pressure and wind speeds. The meteorolog- 
ical data provided by the US aircraft fleet 
had decreased to half its normal levels as of 
31 March, according to the US National Oceanic 
and Atmospheric Administration (NOAA). 


Maintenance woes 


Satellites and weather balloons can fill in some 
gaps, but certain aircraft data are irreplace- 
able. “It’s certainly the case that with the virtual 
loss of worldwide aviation, there is a gap in 
some of the records,” says Grahame Madge, a 
spokesperson for the UK Met Office in Exeter. 

The Met Office estimates that the loss of air- 
craft observations will increase their forecast 


error by 1-2%, but notes that, in areas where 
flights are typically more abundant, scientists’ 
forecast accuracy might suffer even more. 
The Met Office maintains more than 250 UK 
weather stations that provide continuous 
or daily feeds of autonomously collected 
atmospheric and weather data. For now, 
those systems are functioning just fine, but if 
an instrument goes down, Madge says, it will 
be difficult to get staff out to fix the problem. 

Many of the world’s atmospheric-monitoring 
data are collected with little to no human 
intervention, and such projects should be 
able to keep running. The Advanced Global 
Atmospheric Gases Experiment, for exam- 
ple, measures ozone-depleting compounds, 
greenhouse gases and other trace components 
in the atmosphere at 13 remote sites around 
the globe. Many of their systems are auton- 
omous: the stations are each staffed by one 
or two people who perform routine mainte- 
nance to keep the instruments running. Ray 
Weiss, an atmospheric chemist at Scripps who 
leads the project, says that two instruments 
have broken down so far, but the loss of a sin- 
gle instrument or even a whole site for a few 
weeks is unlikely to jeopardize the network’s 
monitoring capabilities. Arlyn Andrews, who 
runs NOAA’s greenhouse-gas-monitoring pro- 
gramme, says that impacts on that network 
have been “relatively minor’, and less than 5% 
of the NOAA sites have lost data so far. 

Unless the situation gets a whole lot worse, 
Weiss anticipates that the programme will 
escape relatively unscathed. “We're limping 
through, is the bottom line.” 
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Charlie Swanton 


Cancer-evolution researcher Charlie 
Swanton at the Francis Crick Institute in 
London has led the conversion of some 
labs into a coronavirus testing facilities. 
Swanton, also a consultant oncologist at 
the University College London Hospitals 
(UCLH), spoke to Nature about the effort. 


How did this start? 

Scientists didn’t want to sit at home and 
read reports about increases in deaths. 
We reached out to the UCLH and they said 
there was an unmet need for staff and 
patient testing. So researchers set up a 
working group to convert laboratories here 
into a rapid real-time polymerase chain 
reaction (RT-PCR) screening facility. Five 
large laboratories here have now been 
repurposed. Everybody wanted to help. 


What does it take to retool a cancer lab 
into a diagnostic testing facility? 

You need the right people, laboratory 
infrastructure and reagents. We have here 
BSL-3 (biosafety-level-3) facilities and 
BSL-3 trained staff, 10-15 RT-PCR machines, 
environments to extract RNA from viral 
samples, and space. We repurposed a lot 
of the software tools that we use to track 
patients’ cancer and blood samples to help 
us track COVID-19 tests. 

We get swabs couriered from the UCLH 
every day: they're taken up in an isolated 
coronavirus-specific lift and barcoded; the 
virus is inactivated, the PCR test done and 
the results reported back via a messaging 
app to medical staff. We are currently 
doing hundreds of tests per day, and hope 
to get up to 500-1,000 tests per day. 


How have researchers adapted? 

This is a new way of working for many 

of our scientists and staff. Much of this 
diagnostic work is repetitive and quite 
boring, but the stakes are high. It’s been 
extraordinary to see the selflessness of 
scientists here to help in the bigger effort 
of getting medics back to the front line. 


Interview by Noah Baker 
This interview has been edited for length 
and clarity. 
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INDIA TRIES TO SLOW THE 
CORONAVIRUS WITHOUT 
WIDESPREAD TESTING 


Country of 1.3 billion people deploys vast surveillance 
network to trace and quarantine those infected. 


By Gayathri Vaidyanathan 


ike many nations, India does not have 

enough kits to test most of its popula- 

tion for the new coronavirus. Instead, it 

is relying on people power: thousands 

of health-care workers are fanning out 
across the country to trace and quarantine 
people who might have had contact with those 
with COVID-19. People are typically tested only 
if they develop symptoms. 

Countries such as South Korea isolated 
infected people on the basis of widespread 
testing, but some scientists say that India’s 
mass-surveillance approach could achieve a 
similar goal, and be relevant for other low- and 
middle-income countries facing kit shortages. 

Testing must still be part of India’s COVID-19 
strategy, and must be expanded rapidly, or 
infections will be missed, says Gagandeep 
Kang, executive director of the Translational 
Health Science and Technology Institute in 
Faridabad. Some people with the virus don't 
have symptoms, so their infections won't be 
detected otherwise. “There is going to be no 
solution to this without testing,” says Kang. As 
of 19 April, the country had conducted some 
400,000 tests — one of the lowest test rates per 
capita in the world. Still, epidemiologists say 
that India’s strategy to trace and quarantine 
contacts, along with the government ordering 
the country’s roughly 1.3 billion inhabitants 
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to stay at home for 21 days from late March 
— a deadline that has since been extended 
to 3 May — have probably helped to slow the 
spread of the virus in some places, and bought 
the country time to prepare its ailing health- 
care system. 

Most people are allowed out for essentials, 
such as food and medical care, but in most 
states those under quarantine are closely moni- 
tored by social workers, andin some areas they 
can’t leave their homes. The stakes are high: 
lockdowns are very hard on people who must 
work to feed themselves. 


Contact tracing 


India was well placed to send scores of 
public-health workers into villages, towns and 
cities to trace contacts and quarantine people. 
Its Integrated Disease Surveillance Programme 
(IDSP) already monitors people across the 
country for communicable diseases, and has 
been used to track HIN1influenza and measles. 

The network started watching for COVID- 
19 soon after the first case arrived in India in 
late January. When workers identify clusters of 
disease, acontainment zone is set up and every- 
one inside it must stay at home. Social workers 
go door to door, and anyone with symptoms is 
tested, along with members of their household 
and close contacts. Those who test positive are 
taken to isolation units or hospitals. 

Since monitoring began, the network has 
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helped to identify a majority of India’s con- 
firmed cases, which stood at 17,625 as of 
20 April, and put hundreds of thousands of 
people under surveillance. (Just 543 people 
are known to have died of COVID-19 in India.) 


Regional differences 


The network is strongest in rural areas and 
urban slums, where people rely on govern- 
ment services, says Giridhara Babu, an epide- 
miologist at the Public Health Foundation of 
India in Bengaluru. The network’s gaps are in 
upper-class urban areas where people have 
an incentive to hide their illness, fearing they 
will be ostracized if they test positive, he says. 

The networkis also stronger insome regions 
than others. Some states, such as Kerala, are 
obtaining phone records to investigate the con- 
tact histories of some people with COVID-19, 
says Amar Fettle, an epidemiologist with 
the state’s IDSP in Thiruvananthapuram. In 
Chennai, acity of roughly seven million people 
in the state of Tamil Nadu, health workers are 
going door to door daily to monitor and test 
anyone showing signs of influenza-like illness. 

Babu thinks it’s too early to say whether 
mass contact tracing is working so far, espe- 
cially in some states. Confirmed cases are low 
compared to many other countries, especially 
given India’s huge population. But that could 
bea result of low testing rates, he says. 

For the current strategy to work, it needs 
to capture all infections — a herculean task, 
says Ronojoy Adhikari, a mathematician at the 
University of Cambridge, UK. He co-authored 
a preprint that estimates the virus’s spread in 
India with and without measures such as the 
lockdown (R. Singh and R. Adhikari. Preprint 
at https://arxiv.org/abs/2003.12055; 2020). 
His model, which has not been peer reviewed, 
says that if even 100 infected people escape 
detection and re-enter the population after 
the lockdown, which is highly likely, cases will 
resurge quickly. “That’s really the crux of the 
matter here. How many people can we really 
effectively contact trace?” Adhikari says. 

A long period of social distancing could 
help to keep infection levels manageable for 
the health-care system until a vaccine is devel- 
oped, his model shows. 

But there is a price. Strict social-distancing 
measures mean that people must stay at home, 
so many cannot work — particularly those on 
a daily wage. Developing nations do not have 
much financial flexibility to pay their inhab- 
itants to stay at home for long, says Ricardo 
Hausmann, an economist at Harvard Univer- 
sity in Cambridge, Massachusetts. 

India has announced a 1.7-trillion-rupee 
(US$22.6-billion) stimulus package for the 
poor, but economists have called it modest. 
India has to weigh the deaths that will be 
caused by the loss of livelihoods against those 
from the disease. “For those who have to stay at 
home, they starve to death,” Hausmann says. 
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Feature 


HOW HOT WILL 
EARTH GET BY 2100? 


Critics have challenged some assumptions behind 
global-warming studies. Researchers are now using a 
fresh set of scenarios to model the future of the planet. 


By Jeff Tollefson 


sworld leaders gathered to mark the 
start of 2050, they looked back on 
the coronavirus pandemic 30 years 
before as a turning point in the 
quest to rein in global warming. 
Nations pulled together to defeat 
the pandemic, and that launched a 
new era of cooperation to preventa 
climate disaster. Investments in green energy 
and newtechnology yielded rapid cuts in emis- 
sions of carbon dioxide, putting the world on 
track to limit global warming to around 1.5 °C 
above pre-industrial levels. 

Or maybe not. In 2050, the world could look 
back and see the pandemic as little more than 
ablipinalong and mostly futile effort to stave 
off global warming. Despite atemporary drop 
in carbon emissions from the 2020 outbreak, 
countries turned to cheap fossil fuels to revive 
their economies after the crisis. Carbon 
emissions soared and temperatures followed, 
setting the stage for 5 °C of warming by the end 
of the century. 

These are just two possible visions of the 
future. Nobody knows how the current pan- 
demic will play out; nor is it clear whether 
humanity will ultimately come together 
to avoid a potential climate catastrophe. 
But climate researchers need to explore 
what kinds of problem might emerge with 
different levels of warming. So they have 
developed a suite of scenarios intended to 
represent a range of futures that humanity 
could face’. Their goal is to investigate how 
different policies might alter carbon emis- 
sions — and how the planet will react to all 
of that heat-trapping gas. 

At one end of the spectrum, optimistic 
scenarios explore worlds in which governments 
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join forces to advance low-carbon technologies 
while reducing poverty and inequality. The 
other end sees countries ramp up their use of 
cheap fossil fuels, pursuing economic growth 
at all costs. 

Research teams have been running these 
scenarios through the world’s major climate 
models for the first time, providing projec- 
tions of how Earth might respond to different 
socio-economic pathways. These simulations 
will inform climate research for years to come, 
and will play a central part in the next major 
assessment of global warming by the Intergov- 
ernmental Panel on Climate Change (IPCC), 
which is due out next year. The research could 
also havea key role inthe negotiations around 
anewset of commitments to reduce emissions 
under the 2015 Paris climate agreement. 

These scenarios update a set that has beenin 
use for the past decade, including one extreme 
— and controversial — version that projects 
a temperature increase of around 5 °C above 
pre-industrial levels by 2100. Critics have 
charged that this particular scenario, which 
has had a central role in climate studies for 
more than a decade, is misleading because it 
includes unrealistic amounts of coal use — a 
roughly fivefold increase by 2100. But many 
researchers dismiss that criticism, saying that 
even such high-emissions scenarios have value 
as long as people understand their underlying 
assumptions and limitations. A massive release 
of methane from Arctic permafrost, for exam- 
ple, could have a similar effect to huge surges 
in fossil-fuel use. 

“We're trying to understand risks, not 
predict the future,” says Donald Wuebbles, 
an atmospheric scientist at the Univer- 
sity of Illinois at Urbana-Champaign and a 
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coordinating lead author on the first volume 
of the latest US national climate assess- 
ment’, released in 2017. The scenarios are not 
designed to project emissions, but to inves- 
tigate different levels of warming and types 
of economic development. They help a wide 
variety of researchers: climate modellers use 
them to test their models and project the 
impact of increasing greenhouse-gas emis- 
sions; economists need them to explore the 
costs of policies; and ecologists rely on them 
to predict changes to ecosystems around the 
globe. 

“This is not science fiction,” says Kristie 
Ebi, an environmental-health researcher at 
the University of Washington in Seattle who 
co-chairs the committee that developed the 
new scenarios. “We need these model results 
to give us insights into the impacts of our 
choices, and now we can do that.” 


Unusual business 


In April 1989, a group of experts tasked with 
forecasting potential futures met in Bilthoven, 
the Netherlands, to prepare for the first IPCC 
assessment, which was due out the following 
year. They created scenarios describing how 
much carbon dioxide, methane and other 
heat-trapping gases nations might produce 
over the next century*. And those possible 
future worlds — from the extremely polluted 
to the exceptionally clean — provided the raw 
material for climate modellers to project how 
the planet might react. 

Since then, the IPCC has updated the main 
emissions scenarios several times. But the situ- 
ation changed in 2006, when the IPCC decided 
to get out of the scenario-development 


“We're trying to 
understand risks, 
not predict the future.” 


business because of pressure from the United 
States and others who argued that the organi- 
zation should assess, not guide, science. 

So, in 2010, a self-appointed group led 
by climate scientist Richard Moss, then at 
the Joint Global Change Research Institute 
in College Park, Maryland, published a new 
framework for creating and using scenarios 
designed to guide research for the IPCC’s last 
assessment’, which was released in 2013-14. 

The group provided a set of four pro- 
jections of future carbon pollution levels 
— dubbed Representative Concentration 
Pathways (RCPs) — that could be run by 
climate-modelling groups around the world 
to produce forecasts about the fate of the 
planet®. The RCPs were selected to portray 
different levels of radiative forcing — anum- 
ber that reflects how much extra warming 
results from greenhouse-gas emissions. The 
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RCPs weren't intended to describe particular 
emissions trends or project how economies 
and technology might change. That job was 
left for other researchers, who would later 
produce sets of emissions trends that could 
drive greenhouse-gas concentrations in ways 
that mimic the RCPs. 

Moss says the RCPs were designed to cap- 
ture the spectrum of warming possibilities in 
the scientific literature and create a signifi- 
cant enough range between the high and low 
projections that climate modellers would be 
able to differentiate between them. And one 
major appeal of the scenario with a5 °C global 
temperature increase that elicited so much 
criticism — called RCP8.5 — is that it provides 
modellers with a powerful signal. “We wanted 
to give enough detail so that climate modellers 
could do their work,” says Moss. Regarding the 
individual scenarios, he adds, “we never meant 
to give them any particular weight”. 

Over time, however, the RCPs took ona life 
of their own. Although the caveats and qualifi- 
cations are all there for those who know where 
to look, many scientists and others started 
using RCP8.5 to represent a world without 
aggressive climate action. 

“It’s very tempting to use RCP8.5 for a 
whole range of reasons, but it’s also pretty 
unrealistic,” says Glen Peters, aclimate-policy 
researcher at the Center for International 
Climate Research in Oslo and co-author of 
a recent commentary on the issue’®. “The 
question is how you balance those issues and 
communicate what it represents.” 

The mischaracterization of RCP8.5 — as a 
projection of what could happen in a busi- 
ness-as-usual world in which governments fail 
to enact climate policies — is endemic, says 
Roger Pielke Jr, a science-policy researcher 
at the University of Colorado Boulder. Pielke 
says that even major scientific reviews such 
as the US national climate assessment have 
defaulted to using RCP8.5 as a de facto base- 
line scenario in which emissions continue to 
spike. That inflates projections of the effects 
of global warming — as well as of the costs of 
inaction, he says. 

Wuebbles defends the decision to use 
RCP8.5 in the US assessment”. The document 
refers to RCP8.5 merely as a “higher” scenario. 
It notes that emissions were consistent with 
this scenario for 15-20 years, until they levelled 
off for a few years around 2014. 

Moreover, RCP8.5 provides scientists with 
a high-risk scenario that is valuable for under- 
standing the risks posed by climate extremes, 
says Céline Guivarch, a climate-change econ- 
omist at the Centre for International Research 
onthe Environment and Development (CIRED) 
in Nogent-sur-Marne, France. Many scientists 
argue that even if coal use doesn't rise ina cat- 
astrophic way, 5 °C of warming could occur by 
other means, including thawing permafrost. 

After the RCPs were published in 2010, 
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ARANGE OF FUTURES 


Researchers have developed new scenarios, called 
Shared Socioeconomic Pathways (SSPs), to explore 
different ranges of development and how they would 
alter the climate. These complement older scenarios 
called Representative Concentration Pathways (RCPs). 
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the plan was to have a new set of fleshed-out 
socio-economic scenarios ready within two 
years. Those would have fed into the IPCC 
reports that came out in 2013 and 2014, which 
found that the rate of warming since 1950 is 
unprecedented over a timescale of centuries 
to millennia, and set the stage for the 2015 
Paris climate accord. 

But the process was much more difficult — 
and took a lot longer — than anticipated. The 
new generation of scenarios, known as Shared 
Socioeconomic Pathways (SSPs), were not 
introduced until 2015. Only now, as the major 
climate-modelling centres around the world 
run their experiments for the 2021 IPCC assess- 
ment, are they taking centre stage in climate 
research. 

Although based on the old RCPs, the new 
scenarios for the first time present fully fleshed- 
out narratives about how the world might 
evolve (see ‘A range of futures’). Each provides 
a broad storyline about how the world might 
change, as well as numbers for key demographic 
trends — population, economic productivity, 
urbanization and education — in every country 
on Earth, which modellers then use to simulate 
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emissions and planetary impacts. 

The teams that produced the SSPs 
intentionally left out any climate policies. 
This approach allows scientists to run their 
own experiments and test the impacts of 
different decisions by governments and soci- 
eties, says Ebi. The flexibility allows her and 
other public-health researchers to compare 
and contrast the health benefits from climate 
policies that simultaneously reduce carbon 
emissions and result in cleaner air. 

“You couldn't do that before,” Ebi says. 
“It’s allowing the climate community to ask 
questions that we couldn't ask.” 


Rocky road 


Although the SSP scenarios are only a few 
years old, they were developed in a world 
very different from today’s. They were shaped 
before the political upheaval of 2016, when the 
United Kingdom voted to exit the European 
Union and the United States elected President 
Donald Trump, who promised to put America 
first and withdraw from the Paris climate treaty. 

But the teams that drafted the SSPs 
imagined a storyline that is very close to the 
path that the United States and other major 
powers are taking. The SSP3 scenario, called 
“regional rivalry — a rocky road’, is defined by 
a resurgence of nationalism. It sees concerns 
about economic competitiveness and security 
lead to trade wars. As the decades progress, 
national efforts to lock down energy and food 
supplies short-circuit global development. 
Investments in education and technology 
decline. Curbing greenhouse gases would 
be difficult in such a world, and adapting to 
climate change wouldn't be any easier. Under 
this scenario, the average global temperature 
is projected to soar to more than 4 °C above 
pre-industrial levels. 

For Ebi, it’s a lesson in humility, because 
the scenario seemed outlandish when it was 
developed. But that is the point. 

“When we started working on this, there was 
no discussion of America first, there was no 
Brexit, there weren't trade wars between the 
United States and China,’ she says. “It’s uncom- 
fortable, but you need to have those kinds of 
pathway. We don’t know what the future is 
going to look like.” 


Jeff Tollefson is a reporter for Nature in New 
York. 
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Special report 


MEASLES IS ON THERISE— AND 
COVID-19 COULD MAKE IT WORSE 


Measles has killed thousands in the Democratic Republic 

of the Congo and is surging worldwide. But more than 

20 countries have suspended vaccination campaigns to cope 
with the new coronavirus threat. By Leslie Roberts 


A driver prepares to transport measles vaccines in the Democratic Republic of the Congo. 


viral outbreak has killed more than 
6,500 children in the Democratic 
Republic of the Congo (DRC) andis 
still spreading through the country. 
The foe isn’t the feared coronavirus, 
which has only recently reached the 
DRC. It’s an old, familiar and under- 
estimated adversary: measles. 

Cases began to spike here in October 2018. 
Children became weak, feverish and con- 
gested, with red eyes and painful sores in their 
mouths, all with the telltale rash of measles. 
“We have been running after the virus ever 
since,” says Balcha Masresha, an epidemi- 
ologist with the World Health Organization 
(WHO) regional Africa office in Brazzaville 
in the neighbouring Republic of Congo. The 
situation has mushroomed into what WHO 
experts say might be the largest documented 
measles outbreak in one country since the 
world gained a measles vaccine in 1963 (see 
‘Measles cases on the rise’). 

Despite the vaccine, the highly contagious 
measles virus continues to spread around the 
globe. In 2018, cases surged to an estimated 
10 million worldwide, with 140,000 deaths, 
a 58% increase since 2016. In rich countries, 


scattered measles outbreaks are fuelled by 
people refusing to vaccinate their children. 
Butin poor countries, the problems are health 
systems so broken and underfunded that it 
is almost impossible to deliver the vaccine 
to people who need it. The DRC’s flood of 
cases shows why measles will keep flaring up 
despite efforts to control it. And the situation 
will worsen with the COVID-19 pandemic: 24 
countries had suspended measles vaccination 
campaigns by 14 April as health-care workers 
scrambled to deal with the coronavirus. 


An overlooked killer 


In poor countries, measles is a killer, espe- 
cially when combined with malnutrition and 
vitamin A deficiency. Estimates are uncertain, 
but the death rate in developing countries 
hovers around 3-6%, and it can spike as high 
as 30%, the WHO says. Its victims often die of 
complications including pneumonia or diar- 
rhoea and dehydration. Others can be left with 
permanent disabilities, including blindness, 
hearing loss and brain damage. The virus also 
impairs theimmune system for months or years 
after infection, creating ‘immune amnesia’ that 
leaves children vulnerable to other infections. 
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The virus is so contagious that few 
unvaccinated people who come into contact 
with it are spared its effects. Scientists define 
infectiousness using the ‘reproduction num- 
ber’ — how many people, on average, would be 
infected by a single person with the virus, ina 
population that has no immunity. For Ebola, 
that number is estimated at 1.5-2.5. The new 
coronavirus terrifying the world seems to be 
somewhere between 2 and 3. Measles tops the 
charts with a reproduction number of 12-18, 
which makes it the most contagious virus 
known. You don’t need to beinthe same room 
as an infected person to catch the virus — it is 
spread by respiratory droplets that can linger 
in the air for hours. 

Two doses of asafe and effective vaccine can 
prevent measles. Many children in poor coun- 
tries are lucky to get a single dose, however, 
which doesn’t always lead to full protection in 
all who receive it. Because the virus is so conta- 
gious, 92-95% of a population needs to be fully 
immunized to ward off outbreaks. Inthe DRC, 
only 57% of children received even one dose of 
measles vaccine in 2018, according to a Unicef 
study (go.nature.com/3cfjalh), creating ideal 
conditions for the virus to explode. 


Failure modes 


In other countries, too, measles simmers at 
low levels until the number of children suscep- 
tible to the virus builds up and it takes off. In 
eachcountry, aslightly different mix of factors 
leads to an outbreak. 

In Ukraine, after a child died of unrelated 
causes following a measles jab in 2008, 
vaccination coverage plummeted from 95% 
that year to 31% in 2016, says Robb Linkins, a 
measles specialist in the global immunization 
division at the US Centers for Disease Control 
and Prevention (CDC) in Atlanta, Georgia. He 
says no one was surprised when, in 2017, a 
huge outbreak hit that has led to more than 
115,000 cases. 

In Madagascar, a shortage of measles 
vaccine helped fuel an outbreak that has swept 
the island nation starting in 2018, causing 
more than 240,000 cases and 1,000 deaths. 

The DRC has a number of difficulties. The 
country has sucha high birth rate — 3.5 mil- 
lion children born each year — that it needs to 
conduct mass vaccination campaigns every 
two years. Those campaigns, in which tens of 
thousands of health workers fan out across this 
vast country, are a logistical nightmare. The 
vaccine must travel from the capital, Kinshasa, 
toremote villages that can be reached only by 
helicopter — or through bloody conflict zones. 

The vaccine must be kept at between 2 °C 
and 8 °C from the time it leaves the warehouse 
until it is administered — a challenge inatrop- 
ical environment where power cuts are fre- 
quent. Health workers must be trained to inject 
it safely. The vaccine comes as a powder, which 
must be reconstituted with sterile diluent and 
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Special report 


MEASLES CASES ON THE RISE 


Reported measles cases are estimated to be at a 20-year high. 
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LARGE MEASLES OUTBREAKS 


The epidemic in the Democratic Republic of the Congo is the largest 


single-nation outbreak for decades. 
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348,158 cases* 
6,504 deaths 


Madagascar (2018-20) 
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*Suspected, not yet officially reported to WHO. 


used within 6 hours. It also comes in ten-dose 
vials; worried about wastage, vaccinators are 
sometimes hesitant to open one when just a 
few children show up to asession, so children 
go unimmunized. Clinics also have to be open 
when parents can make it, and vaccinators 
have to be paid or they won't come, either —a 
problem inacountry plagued with corruption. 

The DRC has also been battling Ebola, out- 
breaks of cholera and yellow fever — and now 
the coronavirus. Measles often takes lower 
priority. In addition, the DRC “is confronted not 
only with political challenges but a long-run- 
ning civil war”, says Katrina Kretsinger, a 
medical epidemiologist and global measles 
expert at the WHO in Geneva, Switzerland. 

Money is a major problem. Vaccination 
campaigns cost around US$1.80 per childinthe 
DRC, says Masresha; international donors foot 
only part of the bill. In 2010, the DRC couldn't 
muster enough funds and cancelled a sched- 
uled campaign. An outbreak that hit at the end 
of the year raged for more than 30 months. Fur- 
ther campaigns in 2013-14 and 2016-17 didn’t 
reach enough children. InJune 2019, after cases 
soared to more than 3,500 a week at the start 
of the year, the DRC government declared an 
epidemic, opening the door to further interna- 
tional aid. By the end of the year, 18.5 million 
children had been vaccinated. 

The WHO estimates that there have been 
more than 348,000 cases and 6,500 deaths, 
but Francisco Luquero, an epidemiologist at 
Epicentre, the research arm of Médecins Sans 
Frontiéres (MSF, also called Doctors Without 
Borders) in Paris, thinks the outbreak is much 
worse. The case count reflects only people 
who goto health centres, he says; many don’t 
in the DRC. As for the mortality estimates, 
“they count deaths that happen right after 
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a measles case. They should look out for the 
next five years,” he says, because of immune 
amnesia. “The outbreak will have a profound 
impact on public health.” 


Eradication hopes 


Steve Cochi, a paediatrician and senior adviser 
tothe CDC’s global-immunization division, is 
especially frustrated by measles’s global toll 
because, froma biological and technical stand- 
point, he says, the disease could be eradicated. 
Unlike Ebola, yellow fever or (probably) the 
new coronavirus, it has no animal host, anda 
cheap and effective vaccine exists. 
Anewvaccine delivery system being devel- 
oped by two teams — one a collaboration 
between the CDC and the Georgia Institute 
of Technology and Micron Biomedical, both 
in Atlanta, and the other at Vaxxas, a biotech 
company based in Sydney, Australia — could 
bea “game changer” for measles control, says 
Cochi. The vaccine uses a microarray patch, 
which looks like a small, round bandage with 
hundreds of microneedles, each carrying a 
small amount of live, freeze-dried vaccine. It 
delivers vaccine under the skin in 5 minutes. 
“Itis very thermostable, takes up little space, 
doesn’t have to be reconstituted, and you don’t 
have to worry about safety,’ Cochi says. But the 
measles patch has languished for lack of fund- 
ing, he says, and has yet to reach clinical trials. 
Until polio is eradicated, the world does not 
have the appetite or money to target another 
disease for extinction, Cochi says. In 2010, 
the WHO’s key Strategic Advisory Group of 
Experts on Immunization (SAGE) declared 
that measles canand should be eradicated, but 
didn’t recommend a target date. Since then, 
advocates have been lobbying the WHO to 
launcha global measles-eradication campaign 
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and set a date for completion, as it did for 
smallpox and polio. Ata meeting last October, 
however, SAGE recommended a different tack: 
waiting until success is in sight — say, five years 
away — before pushing full-bore to wipe out 
the disease. Doing so would require boosting 
rates of routine immunization with two doses 
of measles vaccine to a level never achieved 
before. The DRC is one of about 20 countries 
that have yet to add the second dose to their 
regimes. Eradication would also depend on 
improving the quality of mass campaigns and 
bringing an improved vaccine into use. 


Coronavirus crunch 


The coronavirus pandemic has dealt 
measles-control efforts another huge blow. 
On 26 March, SAGE recommended that coun- 
tries suspend all preventive mass-vaccination 
campaigns, including for measles. Already, 
37 countries have suspended scheduled mea- 
sles campaigns or will soon doso, says Linkins. 
This means that 117 million children may not 
be vaccinated as planned, he says. The DRC, 
however, is continuing its outbreak response. 
“We must protect vulnerable populations 
from the spread of COVID-19,” Linkins says, 
but limiting preventive measles immunization 
will create “dangerous immunity gaps”. Coun- 
tries must be able to resume their campaigns 
quickly after the pandemic subsides, he adds. 
With campaigns cancelled and global 
measles-immunization rates for just one dose 
of vaccine stalled at 86%, the unrelenting cycle 
of outbreaks will continue. This DRC outbreak 
will subside, but Masresha says it will be a 
“temporary victory”: the virus will rebound. 


Leslie Roberts is a science reporter in 
Washington DC. 


SOURCE: WHO/UNICEF 
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Science in culture 


Books & arts 


Patients at Milledgeville State Hospital, Georgia, in 1951. 


Psychiatry under the 


shadow of white supremacy 


From the start, racism has shaped the care of people 
with mental illness in the United States. By Mical Raz 


owdoesaculturethat enslaved people, 
encouraged lynching and developed 

racial segregation decide who is and 

is not sane? That is the question that 

frames Mab Segrest’s book on the 

legacy of slavery for US psychiatry in general 
and for what was in the 1940s and 1950s one of 
the largest psychiatric hospitals in the world. 
Combining archival research with 
fictionalized scenes, Segrest, a feminist and 
anti-racist scholar, recounts more than acen- 
tury of custodial care at Georgia’s infamous 
Milledgeville hospital for people with mental 
illness. It opened in 1842 and Segrest weaves 
its history with the wider trajectory of US psy- 
chiatric care, the ravages of the American Civil 

War (1861-65) and the many manifestations of 


white supremacy and violence against women. 

The book is organized chronologically, 
but includes multiple forays into the pres- 
ent, which can be distracting. Segrest begins 
her account by reconstructing a period when 
slavery was omnipresent and the asylum 
took in only white patients. After the war, 


Administrations 

of Lunacy: Racism 
and the Haunting of 
American Psychiatry 
at the Milledgeville 
Asylum 

x Mab Segrest 

The New Press (2020) 


Administrations 
of Lunacy 


MAB SEGREST : 
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psychiatry followed a racist trajectory that was 
by no means inevitable, argues Segrest. When 
Milledgeville started taking black patients in 
1867, it — like other asylums nationwide — 
adopted racial segregation. 

As part of their treatment, white men 
worked as gardeners; black men had to labour 
on the institution’s farm. White women were 
seamstresses; women of colour worked inthe 
laundry. Segrest uses the asylum’s archive to 
show that luxuries such as writing supplies, 
slippers, soap and carpets were allocated 
much more generously to white patients, 
whereas black patients faced daily discrim- 
ination and neglect. Many died soon after 
arrival, reflecting both their poor health and 
the deplorable conditions they had to endure. 
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Segrest also highlights what was left 
unexplored. Rather than asking how slavery 
might devastate an individual’s psyche, phy- 
sicians treating newly freed African Americans 
discussed how their mental health might have 
been harmed by emancipation. What’s more, 
these patients often came from counties in 
which extreme racial violence, including 
“whippings, assaults, and murders’, was rou- 
tine. Yet, in many cases, this history remained 
undocumented. Asylum psychiatry “main- 
tained a vast silence about the bloodbath all 
around it”, writes Segrest, just as it had previ- 
ously been silent about the violence of slavery. 


Lingering effects 

Milledgeville underwent several name changes 
and ultimately became the Central State Hos- 
pital before the main building closed in 2010. 
In her final chapters, Segrest examines how, 
when such hospitals began to close in the 
1980s, penal institutions took their place. 
As welfare programmes were starved, the 
US prison population spiked, with people of 
colour and people with mental illness dispro- 
portionately incarcerated. Today, 90% of US 
psychiatric-care beds are in jails and prisons. 
Psychiatry will not be able to escape “the after- 
life of slavery”, she argues, until it confronts its 
culpability in mass incarceration. 

A newcomer to the history of psychiatry, 
Segrest’s approach is fresh and creative. She 
uses her imagination to flesh out the realities of 
life within the asylum walls. Describing Frances 
Edwards, amother of seven taken to Milledge- 
ville in 1856, Segrest imagines her arms feeling 
weirdly light and empty without her children, 
as her breasts “ached and leaked”. Segrest also 
finds connections between topics not always 
identified as part of psychiatry’s past. She calls 
attention to the high rates of infant mortality 
in the black community, exploring how such 
factors might have shaped — and still shape — 
black women’s mental health. 

Segrest’s is one of several books in the past 
few years that have foregrounded discussions 
of race in the history of psychiatry and of asy- 
lums. Her impressionistic style and convoluted 
structure contrast sharply with the more rigor- 
ous work of historians such as Martin Summers 
in his 2019 Madness in the City of Magnificent 
Intentions and Wendy Gonaver in The Peculiar 
Institution and the Making of Modern Psychia- 
try, 1840-1880 (2019). Segrest’s mixture of fact 
and fiction can also be confusing. 

But what is lost in clarity is perhaps gained 
in popular appeal. Uncomfortable reading at 
times, this valuable book helps to show how 
white supremacy shaped the definition and 
care of people with mental illness from the 
start, and how psychiatry remains in its shadow. 


Mical Raz is a physician and historian of health 


policy at the University of Rochester, New York. 
e-mail: micalraz@rochester.edu 
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Preppers, bunkers and 
emaciated polar bears 


How to live in the face of death — Mark O’Connell’s 
personal journey. By Caspar Henderson 


re we facing the end of civilization, 

or even the planet? It’s a ques- 

tion that attracted some serious 

scientific firepower even before the 

current pandemic. UK institutions 
such as the Centre for Existential Risk at the 
University of Cambridge and the Future of 
Humanity Institute at the University of Oxford 
are modelling the probabilities of various 
catastrophes, froma giant meteorite strike to 
ascenario in which criminals and psychopaths 
gain ‘easy nukes’ and incinerate a vulnerable 
world. Meanwhile, climate and Earth-systems 
scientists are amassing more evidence by 
the month that, barring rapid and profound 
reorganizationin our societies, climate change 
will batter our world on at least the scale ofa 
major war. 

Rather than assessing the science itself, 
Notes From An Apocalypse explores how such 
threats affect individuals. Written before the 
COVID-19 crisis, it is an eerily prescient mix 
of confession, political critique, meditation 
and comic monologue on living in the face of 
death. It is the second such book from Mark 
O’Connell, the winner of the 2018 Wellcome 
Book Prize (for his first, To Be A Machine, which 


tackled the philosophy that humanity can 
evolve beyond its limitations using science). 
As the scientific and political responses (or 
lack thereof) to threats ranging from global 
heating to plastic dominate the headlines, 
O’Connell probes deeper into our personal 
psyches. In atone somewhere between those 
of writer Samuel Beckett, film-maker Woody 
Allen and poet W. B. Yeats, he asks what hap- 
pens when we're faced with the prospect of 
both individual and global demise. 

A successful literary journalist living in 
Dublin with his young family, O’Connell is 
obsessed with doom. He sets his computer 
home page to an online forum dedicated 
to discussing civilizational collapse, and 
compulsively checks his smartphone for 
YouTube clips of emaciated polar bears, when 
he should instead be watching cartoons with 
his son. 

This fixation leads him into the shadowy 
worlds of ‘preppers’. These self-styled surviv- 
alists stockpile stores and weapons, readying 
themselves for civilization’s impending 
collapse, and feed endless online discussions 
about videos of the contents of their ‘bug- 
out bags’ — knapsacks containing items they 


as 


This converted nuclear missile vault in Glasco, Kansas, has a heated pool and water slide. 
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Many former military bunkers, such as this one near Edgemont, South Dakota, are being repurposed into doomsday communities. 


consider essential for the end-time. The clips 
strike O’Connell as apocalyptic variations of 
‘haul videos’, in which young consumers lay 
out the treasures of a recent shopping binge, 
with Kevlar socks in place of Superga shoes, 
and athleisure swapped for military-grade 
cordage. 

O'Connell hits the road, deploying his 
considerable gonzo journalism skills to seek 
out other doomsday obsessives, each caught 
up in their own dark, imagined futures. He 
visits high-end condos being built ina former 
weapons-storage facility in South Dakota that 
can withstand explosions of up to half a meg- 
atonne. He seeks out the luxury bolthole of 
Peter Thiel, billionaire entrepreneur and ‘sov- 
ereign individual’ — a person who controls vast 
resources and intends to redesign the govern- 
ment to suit their needs after collapse. Thiel 
is one of several Silicon Valley elites who have 
chosen to build their bunkers in New Zea- 
land. And O’Connell attends a meeting of the 
Mars Society in Pasadena, California, where 
enthusiasts share dreams of a new, Ameri- 
can-style frontier in the unspeakably harsh 
conditions on the red planet. 

In each case, O’Connell skewers what he 
sees as acentral psychopathology or distorted 
value system, even as he acknowledges his 
own uneasy fascination and near-complicity. 


Preppers, he argues, are readying not for 
their fears but their fantasies; they contribute 
“nothing to the prevention or alleviation of suf- 
feringin others”. Thiel and the would-be Mars 
colonizers imagine a world somehow beyond 
politics — and taxes — that is almost exclusively 
white and male. 

The book’s final two journeys explore a 
quieter response to the threat of catastrophe. 
O’Connell retreats to the Scottish Highlands 
with members of the Dark Mountain move- 
ment — artistic and mostly gentle souls who, 
knowing climate chaos is imminent, seek 
solace in reconnection with nature. Lastly, he 
takes part in an anti-stag party with a friend 
whois separating from his wife. They visit the 
Chernobyl Exclusion Zone in Ukraine, where 
a future imagined by the Soviets met its end. 
O’Connell is struck by the shards of “our own 
machine age” — shattered glass from broken 


Notes from an 
Apocalypse: A 
Personal Journey to 
the End of the World 
and Back 

Mark O'Connell 
Doubleday (2020) 


MARK O'CONNELL 


© 2020 Springer Nature Limited. All rights reserved. 


screens and a heap of old television sets with 
“ancient circuit boards greened with algae”. 
He wonders if this might be a glimpse of our 
own future. 

Back in Dublin, O’Connell finds he has lost 
his taste for cosmic nihilism. In the radiance, 
joy and hilarity of his kids — the way in which 
they connect with, rather than retreat from, 
the world — he finds inspiration to shift his 
focus from how our lives might end, to what 
makes them worth living. 

Notes from an Apocalypse offers no sci- 
entific analysis of the existential threats we 
face or how we should respond. Instead, it 
illuminates the anxieties and delusions we 
share and oversights we commit, and shows 
how easily our fears (particularly when ena- 
bled by power, money and technology) can 
cause us to walk away from the disasters we 
create — to hide, flee, stockpile — just when we 
most need to engage. In this reflective, hilar- 
ious and disturbing page-turner, O’Connell 
makes a compelling case that connecting with 
nature and each other is the best way to calm 
our apocalyptic dread — and it might even 
increase our prospects of avoiding the worst. 


Caspar Henderson is a writer and journalist in 


Oxford, UK. 
e-mail: caspar81@gmail.com 
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A petrol station in Berlin. On average, drivers accurately judge fuel costs but severely underestimate all other expenditure. 


Runningacarcostsmuchmorethan people 
think — stalling the uptake of green travel 


Mark A. Andor, Andreas Gerster, Kenneth T. Gillingham and Marco Horvath 


Car owners underestimate 
total vehicle costs. Giving 
consumers this information 
could encourage the switch 
to cleaner transport and 
reduce emissions. 


rivate cars are responsible for about 

11% of the world’s total carbon dioxide 

emissions. That’s the greatest share in 

the transport sector, which accounts 

for 24% of emissions overall’. Petrol 

and diesel cars are associated with many 

other harmful effects, such as air pollution, 

congestion and accidents. It is clear that these 

cars must be largely removed from the roads 
to achieve sustainable mobility. 

The good news is that some policies have 


been enacted to reduce greenhouse-gas 
emissions and air pollution from petrol and 
diesel cars. Some markets have tightened 
emissions limits. In the European Union from 
2021 onwards, for instance, the fleet-wide 
average emission target for new cars will be 
reduced by more than 25% — from 130 grams 
of CO, per kilometre (g CO, km‘) in 2015-19 to 
95¢CO, km". The current average is 120 g CO, 
km (on the basis of 2018 data; see go.nature. 
com/39puqyy). And the United Kingdom will 
stop sales of new petrol, diesel and hybrid cars 
from 2035 onwards. 

Cities are taking action, too. For example, 
Oslo has decreased the number of parking 
spots and raised parking fees”. New York City 
and Shenzhen in Chinaare electrifying their bus 
fleets, and Londonis reducing bus emissions 
and improving infrastructure for walking and 
biking. Stuttgart inGermanyis among the cities 
banning older, polluting diesel cars. 

The bad news is that more than 99% of new 
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passenger cars sold worldwide still rely on 
fossil fuels'’, and overall vehicle ownership 
in Europe grew by 25% between 2000 and 2017 
(ref. 4). The continued demand for vehicle 
ownership stems from several factors, includ- 
ing increased income* and more mobility as 
people travel farther to their jobs as a result of 
greater city sprawl®. The transition away from 
conventional vehicles is hindered by the high 
upfront costs of electric cars’ ° and too few 
charging stations, leading to ‘range anxiety’ 
from potential owners”. 

Consumers decide whether to owna vehicle 
on the basis of considerations such as where 
they live and the vehicle’s upfront and lifetime 
costs”. If they systematically underestimate 
total costs, this could increase car ownership 
and its associated emissions. It could also 
make alternative forms of transport — car shar- 
ing, alternative-fuel vehicles, public transport, 
biking or walking, say — seem less attractive. 

Wesurveyed morethan 6,000 citizens across 
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Germany to investigate whether consumers 
grasp the total cost of car ownership. We also 
performed a simple analysis to explore the 
potential implications of this awareness on 
the number of cars on the road. 

We find that people underestimate the 
total cost of owning a car by about 50%. We 
also found that providing personalized 
information on the costs of car ownership 
increased respondents’ willingness to pay for 
a public-transport ticket by around 22% (see 
Supplementary information; SI). We estimate 
that educating people in Germany about the 
true cost could reduce car ownership by up to 
37% and cut associated transport emissions by 
23%. Here, we suggest labelling and commu- 
nication policies that could help to speed the 
transition to cleaner transport. 


Data set and methods 


We conducted asurvey of the heads of German 
households — the people who self-report as 
being responsible for financial decisions — 
between 23 Apriland12June 2018. For every car 
owner, the survey elicited responses onthe cost 
of ownership, based on the individual's car type 
and driving behaviour, as well as socio-economic 
characteristics such as income, number of 
children and education. The work was done in 
collaboration with the survey institute Forsa 
in Berlin. It used a random sample of Forsa’s 
household panel, whichis representative of the 
German-speaking population aged 14 and older. 

Of the 7,823 individuals who started the 
survey, 6,812 completed it, 6,233 of whom own 
a car (92%). Of these, 5,483 stated what they 
thought their monthly car costs were. These 
respondents form the basis for our analyses. Tak- 
ing households’ car type and travel behaviour 
into account, we use detailed information from 
the German Automobile Club (ADAC) and other 
sources to calculate the actual monthly costs of 
car ownership, onaverage, for depreciation, fuel, 
taxes and insurance, and repair (see SI). 


Striking difference 


Our findings were striking. Consumers 
underestimate the total cost of vehicle 
ownership by €221 (US$240) per month on 
average. The misjudgement amounts to 52% of 
the actual costs, so the total cost is nearly twice 
what people think. Using only the respondents 
who provided an estimate for all cost factors, 
the underestimation is €161 on average, which 
is 35% of the actual costs. To be conservative, 
we proceed in our analysis using this sample 
(for the full sample, see SI). 

The difference between estimates and actual 
costs varies widely. This is not necessarily 
surprising. The mean and median of the differ- 
ence is well below zero, clearly demonstrating 
that costs are underestimated on average. 

Wealsoinvestigated the four main costs of car 
ownership: fuel, depreciation, repair, and tax 
and insurance. On average, respondents came 
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COSTLY MISJUDGEMENT 


Car owners in Germany severely underestimated 
major forms of expenditure, except on fuel. 
Knowledge about true total costs could result in 37% 
fewer cars on German roads, an analysis finds. New 
transport policies should make such information 
available at the point of sale. 
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very close to perfectly estimating how much 
they spent on fuel, consistent with the previous 
literature”. But they severely underestimated 
all other major expenditure for running their 
cars (see ‘Costly misjudgement’). To our knowI- 
edge, this misjudgement has not been reported 
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previously, and it can provide leverage points 
for designing new transport policies. 


Fewer cars 


Let’s assume that it is possible to completely 
eliminate the degree to which people system- 
atically underestimate the total cost of owning 
acar. Would this change the number of cars on 
the road? We explored this question by mod- 
elling how car ownership changes when the 
associated costs change, on the basis of previ- 
ous work on US car ownership”. This allowed 
us to calculate the reductionin the 47.1 million 
passenger cars in Germany that one might 
expect if households were perfectly informed 
about running costs. 

We predict that being aware of the true 
cost of owning a car could result in almost 
17.6 million (37%) fewer vehicles on the road 
in Germany. Such a drastic reduction would 
mean less congestion and cleaner air. It would 
also lead to a drop in CO, emissions of about 
37 million tonnes per year: 4.3% of Germany’s 
total, or 23% of emissions from its transporta- 
tion sector (see SI). 

Although increased demand for publictrans- 
port could lead to more CO, emissions from bus 
and rail travel, this effect would probably be 
small. First, the carbon emissions per person 
for each kilometre travelled for these modes 
of transport are around half those of car travel 
(see go.nature.com/2upeejh). Second, emis- 
sions-trading schemes in the EU, for example, 
mean that the increased electricity consumed 
by the growing number of electric trains and 
vehicles is prevented from translating into extra 
carbon emissions for the economy, because 
total emissions are capped. 

Even amore cautious estimate of changes in 
car ownership in response to higher prices of 
new vehicles (as reported in a 2018 US impact 
analysis“) would still imply a 9% reduction in 
car ownership, taking more than 4 million cars 
off German roads (see SI). 

We further used our survey data and empir- 
ical estimates of the impact of car-ownership 
costs to investigate whether such misjudge- 
ments affect the use of public transport and 
electric vehicles. We predict an increase in 
demand for bus and rail travel of 8% and 12%, 
respectively. Purchases of electric vehicles 
could increase by about 73% (see SI). 

Although our survey was conducted in 
Germany, we expect the results to be applicable 
throughout Europe, and probably to countries 
with similar economies elsewhere. In 2017, Ger- 
many had 561 passenger cars for every 1,000 
inhabitants; the EU average in the same year 
was 512 (ref. 15). And people in other countries 
accurately estimate fuel costs”, asin our survey. 

These are preliminary findings. Our cal- 
culations require a number of assumptions. 
For example, we had to assume how much 
car ownership changes when household per- 
ceptions of total costs change, and how much 
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the demand for other transport modes would 
respond to decreased car ownership (see SI, 
pages 5-6). To explain, consider the case in 
which the changes in car ownership and the 
demand for other transport modes were less 
pronounced. Then, eliminating cost beliefs 
would have smaller effects. 

Critics might argue that cost is merely one 
of many factors that influence individuals’ 
decisions to own a combustion-engine car, 
including status, the need for mobility in 
rural areas and the lack of infrastructure for 
electric-vehicle charging and for public trans- 
port. Although cost is indeed only one factor, 
it is a crucial one in the car-ownership deci- 
sion””’*, Another potential criticism is that our 
consideration of vehicle list prices does not 
account for the discounts car buyers usually 
negotiate, which could partly explain why our 
respondents underestimated depreciation. 

We show that taking typical discounts into 
account does not change our main conclusions 
(see SI). Indeed, our approachis conservative 
because we do not consider other factors that 
increase the cost of owning acar, such as extra 
equipment (navigation systems, seat heating 
and sports seats) or premiums for leasing or 
financing. These raise car prices by 30-50% 
on average, and so widen our estimates of 
ownership-cost misjudgement. 


Policy action 


It is unlikely that anything will entirely stop 
people from underestimating the total cost of 
owning acar. Nevertheless, we think that closing 
this ‘awareness gap’ can spur thetransition away 
from conventional cars. How do we do this? 

Cars should be labelled with total costs at 
the point of sale and in registration letters. 
Such information-provision policies influence 
consumer purchasing behaviour ina variety of 
contexts, from buying property to durables 
suchas refrigerators and air conditioning” ”. 
Many countries, including the United States, 
Japan and China, already mandate that new 
cars for sale are labelled with the average future 
fuel cost of driving them. 

Companies that promote alternative forms 
of transport with lower emissions — such as 
electric-vehicle dealers, car-sharing or pub- 
lic-transport firms — could boost business 
by including information on the cost of car 
ownership in their advertising. To prevent 
potential conflicts of interest, the information 
would need tobe certified or come from trusted 
sources, suchas scientific institutions or public 
ministries. But even a general marketing cam- 
paign could at least encourage consumers to 
calculate the cost of driving accurately. 

How successful might such interventions 
be, compared with other options such as a 
fuel tax or subsidizing public transport? We 
calculate that fuel prices would need to rise by 
a massive 1,242% to cut car ownership by the 
same 37% reduction that we predict as a result 


of improved consumer information (see SI). 
This is because fuel price changes largely tar- 
get driving itself, rather than the decision of 
whether to buy a vehicle. 

Another widely discussed policy is to sub- 
sidize public transport more. This might 
have less potential than correcting the mis- 
judgement of car ownership costs. There is 
no evidence available solely from Germany, 
so we based our extrapolations on evidence 
for the relationship between prices for public 
transport and car ownership. This came froma 
meta study of 83 papers, predominantly from 
Europe and the United States”°. We find that 
eliminating public-transport fares entirely 
would decrease car use by only 4.1-6.2%, and 
could have an even smaller impact on car own- 
ership. This approach would also be burden- 
some on the public treasury. In Germany, for 
example, total ticket sales by local transport 
companies amounted to €13.3 billion in 2019. 

We believe that policies on labelling and 
information would be less costly and less polit- 
ically fraught than would many other options. 
The primary resistance to sucha policy might 


“Fuel price changes largely 
target driving itself, rather 
than the decision of whether 
to buy avehicle.” 


come from conventional car dealers and man- 
ufacturers who would be reluctant to see sales 
fall. But information provision is likely to get 
strong public support from consumer-protec- 
tion agencies, for example, which could offset 
such lobbying. Furthermore, providing owner- 
ship cost information could be implemented 
by revising existing fuel labels, which would 
reduce institutional obstacles. 


Next steps 


Future research should focus on total 
car-ownership costs, rather than on fuel costs 
alone. To plug knowledge gaps, we have the 
following recommendations. 

We need to know why consumers underesti- 
mate the costs of car ownership. This could lead 
to information targeted to those who make the 
biggest misjudgements. Campaigns should be 
tested in the field to guide policymakers who 
are aiming to promote greener transport. 

Our survey should be replicated in other 
countries to clarify how and where the results 
apply. This could shed light on what drives the 
systematic underestimation. Furthermore, 
future studies should elicit how the underesti- 
mation can be reduced, for instance by means 
ofsurveys, laboratory experiments and, ideally, 
field experiments. It would also be useful to 
investigate the overall impact of car-owner- 
ship information on mobility behaviour more 
broadly, including the use of publictransport, 
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cycling and walking. Our analysis uncovered 
aneed for further empirical evidence on the 
relationship between the cost of car ownership 
and the number of cars on the roads. 

We see great promise inthis research agenda 
to inform policymakers about cost-effective 
approaches to reducing emissions from trans- 
portation. One of the goals of the European 
Green Deal, proposed by the European Com- 
mission last December, is to accelerate the 
shift to sustainable and smart mobility. And 
Horizon Europe, the EU’s €100-billion research 
programme, is currently defining its agenda 
for research during 2021-27. Both present an 
invaluable window of opportunity. 
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Readers respond 


Correspondence 


Use newfound trust 
in science wisely 


The current COVID-19 pandemic 
calls for a renewed public 

trust in science — for better 

or worse. We urge the global 
scientific community to seize 
this opportunity to build on that 
trust. 

Three months into the 
pandemic, we issued a 
questionnaire to a panel 
of 337 US residents who 
represented a cross-section 
of the general public. Our aim 
was to find out how their trust 
had changed from before the 
pandemic (data collected 
in mid-August 2019). Those 
reporting “a lot of trust” inthe 
federal government remained 
at an abysmal 1%, whereas 
“strong trust” in science jumped 
from 41% to 48%. We found 
that trust in science was the 
most important predictor of 
compliance with public-health 
recommendations for limiting 
viral spread. 

With great trust comes 
great responsibility. As we 
ramp up research to meet the 
public’s need for solutions, 
we must be especially careful 
to communicate transparent 
information about our 
capabilities, uncertainties, 
disagreements or agreements 
(see S. van der Linden et al. 
Nature Hum. Behav. 2, 2-3; 
2018). 

Competence and warmth 
are judged by psychologists to 
be crucial for trustworthiness. 
Although scientists rate 
highly on competence, they 
can sometimes come over as 
dispassionate (see G. Cardew 
Nature 578, 9; 2020). Now, 
more than ever, we must show 
our commitment to humility, 
honesty and the public good. 


Patricia Andrews Fearon, 
Friedrich M. Gétz, David Good 
University of Cambridge, UK. 
pba21@cam.ac.uk 


Better lives call for 
more than insight 


Hetan Shah argues that global 
problems need social science 
to help solve them (Nature 577, 
295; 2020). I contend that he 

is both right and profoundly 
wrong. 

Developing social inquiry as 
a social ‘science’ is a blunder 
that goes all the way back 
to the eighteenth-century 
Enlightenment (see go.nature. 
com/34exatc). To promote 
human welfare, academia needs 
to provide practical solutions to 
problems of suffering, poverty, 
injustice and avoidable death. 
It needs to articulate and assess 
possible solutions in terms 
of actions, policies, political 
programmes, philosophies of 
life and ways of living. 

The task of social inquiry 
and the humanities is to guide 
people on howto resolve such 
issues and conflicts in effective, 
intelligent, humane ways. In 
connection with the climate 
crisis, for example, the public 
needs to know precisely what 
must be done by governments, 
businesses, the media, public 
institutions and individuals to 
mitigate global warming. 

However, social scientists 
down the decades have fallen 
short in providing such 
guidance. In my view, this is 
because their focus has been 
on acquiring knowledge about 
society when it should instead 
be on promoting social progress 
towards as good a world as 
possible. 


Nicholas Maxwell University 
College London, UK. 
nicholas.maxwell@ucl.ac.uk 
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Climate: managing 
deep uncertainty 


In our view, Zeke Hausfather and 
Glen Peters’s recommendation 
to assign a single set of best- 
estimate probabilities to all 
future emissions scenarios 

as a means to assess climate- 
change risks (Nature 577, 
618-620; 2020) could give 
decision-makers a false sense 

of certainty, leading to costly 
adjustments if the world evolves 
in unanticipated ways. 

The Society for Decision 
Making Under Deep Uncertainty 
(www.deepuncertainty.org), to 
which we belong, offers a better 
strategy. It relies on methods 
that focus on the implications 
of alternative scenarios and the 
extent to which response tactics 
are shared across a wide range of 
scenarios. This helps to manage 
uncertainties — for example, 
in sea-level rise after 2050 — by 
identifying long-term options 
and short-term, flexible actions 
that can prepare for a range of 
future emissions. 

Bypassing the need to assign 
probabilities enables decision- 
makers to better understand the 
combination of uncertainties 
that most affect their choices, 
thereby reducing locked-in 
choices and decision delays that 
can arise when using a single 
scenario. 


Judy Lawrence Victoria University 
of Wellington, New Zealand. 
judy.lawrence@vuw.ac.nz 


Marjolijn Haasnoot Deltares and 
University of Utrecht, Utrecht, 
the Netherlands. 


Robert Lempert RAND 
Corporation, Santa Monica, 
California, USA. 
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Climate: why use 
2100 for timeline? 


Zeke Hausfather and Glen 
Peters’s discussion of future 
climate scenarios focuses on 
what we might expect by 2100 
(Nature 577, 618-620; 2020). 
But why 2100? This inordinate 
focus on the century’s 

end, largely derived from 
Intergovernmental Panel on 
Climate Change scenarios, has 
coloured much of the literature 
for years and now saturates the 
public debate. 

Take, for instance, the 
authors’ tags for warming 
above pre-industrial levels: 
1.5°C, “mitigation required 
to reach Paris goals”; 2.5 °C, 
“modest mitigation”; 3 °C, 
“weak mitigation (likely)”; 4 °C, 
“average no policy (unlikely)”; 
5 °C, “worst-case no policy 
(highly unlikely)”. Peak warming 
will post-date peak emissions 
and, depending on feedbacks, 
the planet will still be warming 
in 2100 — even insome of the 
“likely” pathways and certainly 
inthe “unlikely” and “highly 
unlikely” ones. 

Let’s move the discussion to 
peak impact anda full-recovery 
timescale, especially when 
considering policy. 


Paul N. Pearson Cardiff University, 
UK. 
pearsonp@cardiff.ac.uk 
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Whenit rains, lava pours 


Michael Manga 


Early 2018 saw unusually heavy rainfall in Hawaii. Modelling 
now suggests that groundwater pressure increased owing to 
rainfall: this might have triggered changes in the eruption of 
the island’s Kilauea volcano. See p.491 


The most recent eruption of Kilauea volcano 
on the island of Hawaii began in 1983. For 
35 years, most of its magma emerged froma 
set of fissures in the volcano called the upper 
east rift zone. But on 3 May 2018, Kilauea’s 
lower east rift zone opened up, giving way to 
a massive outpouring of lava that devastated 
the southeastern part of the island’ (Fig. 1). 
An important question is why this change 
occurred in May 2018, rather than earlier or 
later inthe course of the eruption. On page 491, 
Farquharson and Amelung? propose that 
record-breaking levels of rainfall in early 2018 
increased groundwater pressures which, in 
turn, madeit easier for rock to break and hence 
magma to rise to the surface at new locations. 

The creation of a pathway that brings 
magma to Earth’s surface begins with the 
mechanical failure of rocks. This failure can 
occur in two ways: new cracks can open, or 
existing faults can slip. Both processes can 
be promoted by pressure changes in ground- 
water. For the former, increases in fluid pres- 
sure decrease the amount of stress needed to 
opennewcracks. For the latter, faults can slip 
when the stresses acting parallel to the fault 
(shear stresses) overcome those perpendicu- 
lar to the fault (normal stresses). These normal 
stresses act to clamp the fault shut. Increas- 
ing fluid pressure in rocks lowers normal 
stresses without changing shear stresses, thus 
promoting fault failure. 

Heavy rainfall increases water levels 
underground and thus pressure in ground- 
water. The volcanic rocks in Hawaii are very 
permeable, which allows water to infiltrate and 
pressure changes to propagate to a depth of 
several kilometres, close to where magma is 
stored. Fluid-pressure changes take time to 
propagate from the surface to those depths. 
Thus, downward migration of rock failure over 
time, along with a time lag between the accu- 
mulation of water at the surface and failure at 


depth’, would be key indicators that rainfall 
was the cause of rock failure at Kilauea. 

Farquharson and Amelung modelled 
pressure changes at Kilauea caused by rainfall 
in the months leading up to the eruption on 
3 May 2018. Their model showed an increase 
in pressure of tens to hundreds of pascals at 
depths of several kilometres. On the basis of 
these changes, along with four sets of observa- 
tions indicating that eruptions at Kilauea are 
associated with patterns of substantial rainfall, 
the authors propose that heavy rainfall pro- 
moted the rock failure that enabled magma 
to flow into the lower east rift zone. 

Is their hypothesis plausible? The pressure 
changes computed by their models are small 
— smaller than stresses from tides. However, 
if rocks are already close to breaking, such 
changes might be sufficient to initiate fail- 
ure. The 2018 eruption was accompanied by 


a magnitude-6.9 earthquake, and examples 
of earthquakes caused by pressure changes 
on this scale are abundant’. For example, the 
widespread increase in earthquake frequency 
inthe central and eastern United States inthe 
past decade results from wastewater injection 
into permeable rocks that increases water 
pressure and changes stresses°. 

The geological record also confirms that 
changes in stresses at Earth’s surface can 
modulate volcanic activity. On land, volcanism 
is promoted by the retreat of glaciers®. Sea- 
level changes between glacial and interglacial 
periods can modulate eruption rates at mid- 
ocean ridges’. Stresses from large earthquakes 
increase the probability of volcanic eruptions® 
and can change activity at volcanoes that are 
already active’. 

Although it is well established that changes 
in water pressure promote earthquakes, they 
are not necessarily a direct cause of magma 
eruption. To begin moving through Earth’s 
crust, magma must create large enough 
stresses in the surrounding rocks to opena 
pathway. Earthquakes triggered in the crust 
around that stored magma, however, can actu- 
ally relieve stress — as such, they might make it 
more difficult for magma to erupt”. 

Ultimately, whether fault failure from 
water-pressure changes can occur close 
to stored magma, as hypothesized by 
Farquharson and Amelung, remains uncertain. 
The first magma to erupt from the lower east 
rift zone in 2018 was old, perhaps left over from 
an earlier, 1955 eruption”, implying that the rift 
zone was already hot. Asa result, groundwater 
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Figure 1| Lava from the lower east rift zone of Kilauea volcano. Farquharson and Amelung? propose that 
exceptionally heavy rainfall led to the eruption of magma from this part of the volcano in 2018. 
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in the rift zone might have been vapour at 
shallow depths”, and at greater depths it could 
have been a supercritical fluid (a substance 
that is notina distinct liquid or gas phase, but 
has properties of both). The high compressi- 
bility of both vapours and supercritical fluids 
would dampen the magnitude of pressure 
changes inthe authors’ model, making failure 
less probable. 

How, then, can we test the hypothesis 
that rainfall initiated the lower east rift zone 
eruption? Unfortunately, subsurface pressure 
measurements — and hydrogeological data 
more generally — are rarely part of volcano 
monitoring. Instead, as with many geoscience 
and Earth-history questions, we have to look 
back in time using the geological and histori- 
cal record of eruptions. In support of their 
hypothesis, Farquharson and Amelung ana- 
lysed all reported eruptions at Kilauea since 
1790, and showed that the volcano tends to 
erupt at the wettest time of year. 

Should we increase alert levels at volcanoes 
after heavy rainfall? We could ask the same 
question about other stress changes, such as 
those from regional earthquakes. This is an 
open question. These stress changes are small, 
and hence, if anything, modulate the exact 
timing of the surface eruption. At Kilauea, 
there were other sources of stress — in fact, a 
change in eruption behaviour had been antici- 
pated on the basis of ground-deformation 
measurements and inferred magma move- 
ment. The Hawaiian Volcano Observatory 
issued a warning on 17 April that anew vent 
might open!. 

The possibility that external processes 
initiate volcanic eruptions is a reminder that 
volcanoes are part of adynamic Earth system. 
Volcanic eruptions influence all surface envi- 
ronments, including climate and weather”. 
Changes in those surface environments, 
such as heavy rainfall, might also influence 
eruptions. We are only just beginning to 
understand these interactions. 


Michael Manga is in the Department of Earth 
and Planetary Science, University of California, 
Berkeley, Berkeley, California 94720-4767, USA. 
e-mail: manga@seismo.berkeley.edu 
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Permanent electric 
control of spin current 


Stefano Gariglio 


The development of low-power methods for controlling a 
property of electrons known as spin could help to maintain the 
historic rates of progress that are occurring in computational 
power. Just such a method has now been reported. See p.483 


A promising technology for the next 
generation of computers is spintronics, atype 
of electronics that depends on the spin — the 
intrinsic angular momentum — of electrons, 
rather than their charge. However, available 
methods for controlling spin require electric 
currents that are too large for practical appli- 
cations. On page 483, Noél et al.’ report an 
approach that allows low-power spin control 
using an electric field. 

The exponential progress in increasing 
computational power over the past 50 years 
has been largely driven by the relentless 
miniaturization of the field-effect transistor’, 
the basic component of silicon chips. This 
consistent downscaling was anticipated? in 
1965 by electronic engineer Gordon Moore, 
and has led to the staggering 2 billion tran- 
sistors that are now typically found in the 
processors of modern personal computers. 
The semiconductor industry has come up with 
aroad map outlining the technological devel- 
opments in computer materials, devices and 
systems that will be needed to maintain these 
historic rates of increase in computational 
power (https://irds.ieee.org). 

Agrowing section of the road map addresses 
a pressing problem for the field: transistors 
based on currently used technology cannot 
be scaled down much further, because the 
physical limits of miniaturization will soon 
be reached. There are no known solutions for 
several of the technical and materials issues 
associated with this problem. Materials sci- 
entists, physicists and engineers are therefore 
investigating an array of potential new work- 
ing principles for computer technology. The 
development of new approaches also allows 
other goals to be targeted, such as lowering 
energy consumption, or incorporating multi- 
ple functionalities into components to speed 
up data processing. 

One way of reducing power consumption 
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would betoeliminate the need for acontinuous 
power supply to maintain the logic state 
(ON or OFF) of transistors. This can be 
achieved using ferroic materials (such as 
ferroelectric compounds, which have a per- 
manentelectric polarization) or piezoelectric 
mechanical devices, which require power to 
switch between the logic states, but not to 
retain those states*. Spintronics technology 
has also seen a surge of interest, because this 
approach is expected to reduce electrical 
dissipation’ — wasteful loss of electrical power 
as heat. Combinations of ferroic approaches 
with spintronics® could be particularly effec- 
tive in the race to develop more-efficient 
computing technology. 

However, many of these approaches will 
require new materials — for example, the semi- 
conductors used in conventional electronic 
devices donot have ferroic properties. A fam- 
ily of compounds known as complex oxides 
are of particular interest, because they host 
permanent electric and magnetic dipoles, 
thereby opening the door to applications that 
require permanent states. Although complex 
oxides are not as good as semiconductors for 
use in classical transistors because they pro- 
duce more electrical dissipation, they have 
remarkable properties for spintronics’. 

Interesting electronic phases have been 
observed to form at the interfaces between 
two complex oxides. Noél and co-workers 
focus ona phase called an electron gas: an 
ultrathin (a few nanometres thick) layer of 
conducting electrons that forms at the sur- 
face of strontium titanate (STO) that has been 
covered by a layer of aluminium. 

STO probably provides the best illustration 
of the complexity of the electrical properties 
of transition-metal oxides. In its pure form, it 
isa dielectric material (an electrical insulator) 
that has atendency to become ferroelectric at 
temperatures below 4 kelvin, but fails to doso 
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Figure 1| A ‘ferroelectric-like’ spin-orbit transistor. A type of electronics knownas spintronics involves 
controlling the spin (the intrinsic angular momentum) of electrons. Noél et al.'injected a spin current 
(arrows in circles indicate electron spins) from a magnetic nickel-iron alloy into strontium titanate (STO). 
The STO was covered by a thin layer of aluminium, which induces the formation of an electron gas (a layer 
of highly mobile electrons) at the STO surface. The electrons in the gas exhibit spin-orbit coupling — their 
spins couple to their momenta. This effect converts the spin current into a conventional charge current 
(red circles indicate electron charges). The authors applied a voltage across the insulating STO beneath the 
electron gas to change the sign of the spin-orbit coupling, and hence the direction of the charge current. 
The insulating STO exhibits surprising ‘ferroelectric-like’ behaviour: it has an overall electrical polarization 
whose direction (large arrow) depends on the applied voltage. The polarization remains in the absence of an 
electric field; this allows permanent control of the spin-orbit coupling and thereby of the direction of the 


charge current. 


because of quantum fluctuations®. However, 
tweaks to the chemistry of STO (such as the 
replacement of some of the strontium atoms 
by calcium atoms) can push the compound 
over the edge to become truly ferroelectric’. 
Andthereplacement of some of the strontium 
atoms by lanthanum atoms increases the 
number of electrons in STO through a pro- 
cess knownas electron doping, and turns the 
material into a metallic conductor, and even 
into asuperconductor™. 

One consequence of the electrons at the 
aluminium/STO interface becoming con- 
fined ina gas is that their spin is coupled to 
their momentum, a phenomenon called 
spin-orbit interaction”. Workers from the 
same research group as Noél et al. have pre- 
viously demonstrated” such spin-momen- 
tum entanglement: when they injected a 
spin-polarized current (a flow of spins that are 
oriented in one direction) into the electron 
gas, they observed a conventional electric 
current (a charge current) whose direction 
depends on the spin orientation and on the 
spin-orbit coupling. This is the result of 
the spin polarization being converted into 
electron motion by the spin-orbit interaction. 

Noél et al. now report surprising observa- 
tions of the STO electron-gas system that adds 
to this complex behaviour. When the authors 
applied an electric field to the STO to control 
spin-orbit coupling in the gas, they observeda 
hysteresis effect — the direction of the charge 
current produced in the gas ‘remembers’ the 


polarity of the applied electric field, even 
after the field is removed (Fig. 1). Moreover, 
when they characterized the properties of the 
insulating STO beneath the gas, they observed 
features commonly attributed to ferroelectric 
compounds: when the polarity of the voltage 
applied across the STO is reversed, a spike of 
charge current is produced. Such a phenom- 
enonis commonly associated with the reversal 
of electric dipoles in ferroelectric materials, 
and is at the core of the definition of electri- 
cal polarization in the modern theory of 
ferroelectricity”. 

The authors’ system has potential 
applications for spintronics, because it acts 
as a spin detector, analogous to optical polar- 
izers that transmit light polarized only along 
a particular direction. Moreover, when the 
polarity of the applied voltage is inverted, 
the selectivity of the spin filter changes so 
that electrons with ‘spin up’ polarization move 
right, instead of left. Crucially, this selectivity 
remains in the absence of an applied voltage. 
This minimizes power consumption and opens 
up applications for memory storage. 

Ferroelectric-like behaviour has been 
observed previously in STO (see refs 14-16, 
for example). However, a peculiarity of the 
effect observed by Noél and colleagues is 
that it occurs only when the applied electric 
field exceeds a critical value. This raises con- 
cerns: true ferroelectric materials don’t show 
polarization only at high fields; it is an intrin- 
sic state that occurs even in the absence of an 
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external electric field. Noél and colleagues’ 
data indicate that something similar to a 
‘relaxor’ state occurs in their system at low 
temperatures, in which a fraction of the STO 
consists of nanometre-scale domains that 
have an electrical polarization, and move or 
reorient in an applied electric field. By con- 
trast, all of the material in a true ferroelectric 
compound is polarized. 

One can speculate that the movement of 
polar walls” — boundaries that form between 
two STO domains that have different crystallo- 
graphic orientations’ — produces the spikes 
of current observed by Noél and co-workers. 
But other microscopic mechanisms might be 
at play, given the richness of STO’s electronic 
behaviour; point defects produced in STO dur- 
ing the fabrication of the authors’ device could 
also havea role. Research into the domain walls 
inSTOis currently booming, and will probably 
find an explanation for the observed behav- 
iour. In the meantime, the authors’ demon- 
stration of a permanent switch of spin-orbit 
coupling at a complex-oxide interface shows 
the potential of this class of material to com- 
pete inthe race for more-efficient computing. 
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The pace of biodiversity 
change in a warming world 


Jennifer M. Sunday 


The timing of disruptions to biodiversity associated with 
global warming is a key, but little-explored, dimension of 
change. Will losses in biodiversity occur all at once, or be 


spread out over time? See p.496 


Projections of the effects of climate change on 
multiple species are often made by estimating 
the change predicted for a single future time 
point; for example, by asking how the geo- 
graphical distributions of multiple species will 
differ in 2100 from those today’. However, this 
approach does not capture the pace, timing or 
possible synchrony of biodiversity changes 
across time. Acute synchronous impacts can 
potentially be more damaging to a system 
than those spread over time, in terms of both 
human adaptation to biodiversity losses and 
ecosystem resilience. On page 496, Trisos 
et al.” report an approach for predicting how 
climate change will affect future biodiversity 
patterns. 

The authors estimated the timing and 
synchrony of climate impacts on organisms 
globally by asking when species in a given 
region willbe exposed to temperatures outside 
their normal global experience (by consid- 
ering projected future temperatures due to 
climate change). They did this by compiling 
geographical-range maps for approximately 
30,000 species, including birds, mammals, 
reptiles, amphibians, fishes, marine inver- 
tebrates, corals and seagrasses, and using 
temperature-projection models to identify 
the warmest average annual temperature 
experienced between 1850 and 2005 by each 
species within its range. Dividing Earth into 
grid cells of 100 square kilometres and using 
predicted climate information, the authors 
determined when each species would experi- 
ence annual average temperatures above its 
historical annual average, encountered any- 
wherein its range, for an extended period. The 
result provides an estimate of whena species 
will be exposed to unprecedentedly high 
temperatures. 

Trisos and co-workers’ approach builds 
on ‘time of emergence’, a concept used when 
analysing climate change. Time of emergence 
describes the time at which a climate varia- 
ble, suchas temperature, emerges beyond the 
historical values of variation observed for a 
particular location — in other words, when the 
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average value of the measurement of interest 
becomes more extreme than the previously 
encountered natural variability. Trisos et al. 
offer innovation in applying this concept to 
the realm of biodiversity. First, rather than 
considering the variation experienced at just 
one location, they considered the full breadth 
of variation experienced across each species’ 
geographical range, defining an organism 
as being ‘exposed’ ina specific grid cell only 
after it has experienced temperatures above 
its range-wide maximum (and with annual 
temperatures remaining above this value for 
a minimum of five years). Second, because 
the authors considered multiple species in 
an assemblage (the group of species present 


High abruptness 


Species exposure (%) 


Temperature (°C) 
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1980 2010 2040 2070 2100 


ina given grid cell), it was possible to assess 
the relative timing of exposure in a graphical 
format that the authors calla horizon profile 
(Fig. 1). This enables the synchrony in the 
timing of exposure events for the species 
in a region to be quantified and easily 
visualized. 

The authors results predict that the greatest 
levels of exposure will occur at latitudes nearer 
the Equator, and, most notably, that there will 
be high synchrony in the timing of exposure 
between species inthe same grid cell, for grid 
cells both on land and in the ocean. Trisos 
etal. find that most species ina given cell will 
usually become exposed to unprecedentedly 
high temperatures within the same decade. 
If this exposure results in local extinction, it 
suggests the following disturbing scenario. 
We might initially see a small trickle of species 
being lost from an assemblage, but this will be 
followed by an abrupt loss of most species in 
the assemblage within the same decade. 

What mechanism might explain this 
predicted pattern? The abruptness in expo- 
sures predicted by Trisos and colleagues is 
not due to any particular abruptness in the 
timing of climate change itself — although 
similar predictions of abrupt ecological 
change have been based on the additive effects 
of gradual climate change with abrupt natu- 
ral climate variability, including weather’. 
Instead, it seems to be attributable to the 
similarity of the thermal niches occupied 
by the species in each grid cell. Trisos and 
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Figure 1| Exposure of species to unprecedented temperatures owing to climate change. Trisos et al.” 
report a global analysis describing when species (as assessed in grids of 1OO square kilometres) are predicted 
to encounter, for more than five years, higher annual average temperatures than they have previously 
experienced anywhere in their geographical range. This state is called species exposure, and the authors 
assessed more than 30,000 terrestrial and marine species. a, For the majority of locations, the timing of 
when most species ina grid are exposed occurs highly abruptly, as in this example from the Amazon basin. 
Temperatures are from projection models, and the predicted future temperatures due to climate change 

are from the RCP8.5 model, which depicts a scenario of high greenhouse-gas emissions’. b, By contrast, 

in the Gobi Desert (located in northern China and southern Mongolia), species exposure occurs with low 


abruptness. (Graphs based on Fig. 1 of ref. 2.) 
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colleagues find that more than half of the 
species ina given cell (and almost 90% in most 
marine assemblages) tend to have geograph- 
ical ranges that encompass similarly warm 
temperatures, such that they would all face 
exposure at around the same time. 

Such a striking pattern of shared thermal 
niches within assemblages has been observed 
before, ina global analysis of marine fishes and 
invertebrates‘. In that study, species’ thermal 
niches were found not to change gradually 
with latitude, but instead to have distinct 
transition points, indicating that species 
belong to what are termed thermal guilds*. 
These shared thermal niches could be due to 
physical boundaries or ecological interactions 
that restrict the ranges — and temperatures 
experienced — of multiple species similarly. 
Or this phenomenon might be the result of a 
low rate of evolution in the range of tempera- 
tures across which the species can fundamen- 
tally persist, leading to the maintenance of 
thermal guilds. 

When does this abrupt exposure happen? 
It is predicted that it will occur at different 
times for grid cells around the world, from 
some predicted to be occurring already in the 
ocean, to others occurring towards the end 
of the projected time range, in 2100. That the 
timing is different across grid cells is a good 
thing, because at least all of the assemblages 
aren't predicted to experience abrupt losses 
at the same time. But, notably, the timing of 
exposure does not correlate with the timing 
of climate-change emergence in temperature, 
suggesting that the latter metric might bea 
poor predictor of major biodiversity change 
within a given grid cell. 

Trying to project the timing of biodiversity 
shifts is anoble objective that will surely help 
us to develop management systems and antic- 
ipate crises. Although Trisos et al. provide an 
initial approach that offers useful insights, 
further studies should attempt to validate 
and qualify these predictions. For example, 
Trisos and colleagues used temperatures out- 
side species’ current thermal niches to define 
climate exposure, but we don’t know what will 
really occur when species experience such 
temperatures — many can certainly tolerate 
temperatures beyond those found in their 
current ranges**. The timing of exposure to 
truly limiting environments might turn out 
to be more diverse across species than cur- 
rently predicted by Trisos et al. if variation 
in species’ fundamental climatic niches (the 
range of temperatures and other climate var- 
iables across which an organism can survive) 
is considered. It will also be useful to consider 
the flip side of the range-shift issue: the timing 
and abruptness with which new species enter 
an assemblage as a result of range extensions 
arising from climate change. 

Most crucially, as climate change pro- 
gresses, we should be able to test and refine 


projections such as these using real-time 
observations. Where are biodiversity changes 
already occurring abruptly? The need for 
systematic global biodiversity monitoring 
has never been stronger. 
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Can't see the wood 


for the trees 


Mark Pagel 


Evolutionary-tree diagrams, which show the branching 
relationships between species, are widely used to estimate 
the rates at which new species arise and existing ones become 
extinct. New work casts doubt on this approach. See p.502 


Scientists often want to make inferences about 
what the biological past was like, and how that 
past gave rise to the present, because doing 
so allows them to understand the processes 
that drive evolution. But on page 502, Louca 
and Pennell’ challenge a major aspect of that 
enterprise. 

Specifically, their work regards the issue 
of estimating past rates of speciation and 
extinction, which are, respectively, the rates 
at which new species arise and existing species 
go extinct. These rates determine the num- 
ber of contemporary species of various forms. 
Thereare, for instance, around 6,600 species 


“Assumptions are being 
made about the things 
that we would like to 
estimate.” 


of songbird (passerines), which constitute 
more than half of all existing bird species, 
and we might therefore be tempted to say 
that songbirds have a high rate of speciation 
in comparison with that of other birds. But it’s 
also possible to speculate that they have alow 
extinction rate. Louca and Pennell show that 
the uncertainty is even worse than this: not 
only can we not estimate these two rates, but 
also there is aninfinite number of different sets 
of these two parameters that are equally good 
at describing any particular outcome, such 
as the number of species of contemporary 
songbird. 

Because fossils are scarce or non-existent 
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for the vast majority of species, evolutionary 
scientists instead estimate speciation and 
extinction rates from phylogenies — tree 
diagrams that describe the patterns of 
descent among a group of contemporary 
species (Fig. 1a,b). For any such phylogeny, 
it is easy to construct what is termed a 
lineage-through-time plot; this records the 
cumulative number of lineages up to that point 
in time on the tree that will eventually leave 
one or more living descendent species (Fig. Ic). 
The slope of the curve fitted to such a plot, 
often denoted by A, is the net speciation rate. 
This is equal to the difference between the 
rate of speciation, termed b (or birth), and 
the rate of extinction, termed d (or death). It 
is described by the equationA=b - d. 
However, it is known that a difficulty 
arises in estimating b and d, because if all 
that is available is the number of species 
that have survived to the present, such as 
our 6,600 songbirds, any pair of b and d that 
returns the same value of A will produce an 
identical lineage-through-time curve, and 
there is an infinite number of these pairs. In 
fact, it turns out that for the simple case of 
estimating b - d, suchas described here, a fea- 
ture of the shape of the lineage-through-time 
curve can be exploited to estimate the rate of 
extinction, and then the rate of speciation 
can be found by subtraction’. But to do so 
requires making the assumption that both of 
these rates are constant throughout the entire 
time span of the tree, when instead they almost 
certainly vary between the different branches 
(lineages) of the phylogeny, and throughtime. 
This is where Louca and Pennell step in, 


Nature | Vol 580 | 23 April 2020 | 461 


News & views 


@ 


Speciation 


x Extinction 


Past 


Figure 1| Assessing evolutionary histories. Louca and Pennell’ raise 
questions about a standard approach to estimating past rates of 

species formation (speciation) and extinction that uses data froma 
lineage-through-time plot. The number of species in the present depends 

on how speciation and extinction rates varied over time in the past. Using 
mathematical modelling, the authors reveal that an infinite number of pairs 
of speciation and extinction rates could give rise to any given outcome, and 
it is thus unclear how to determine the correct rates. a, b, Examples of known 
extinctions are rare, and are shown in these hypothetical tree diagrams only 


because the novelty and mathematical 
sophistication of their work lie in showing 
that we cannot estimate these ‘time-varying’ 
speciation and extinction rates. The authors 
invoke earlier work? that defines the existence 
of atree’s ‘deterministic’ lineage-through-time 
curve: this is a set of differential equations 
(equations describing rates of change) that 
fully determine the number of lineages ina 
tree at any given time. Louca and Pennell’s key 
result is then to show that there is an infinite 
number of alternative sets of time-varying 
speciation-extinction rates that yield the 
same number of lineages at any given time as 
does the deterministic lineage-through-time 
curve. They further show that the most prob- 
able estimates of the two rates (calculated 
by maximum-likelihood methods) do not 
necessarily identify the correct underlying 
model — as demonstrated by an analysis 
of hypothetical cases for which the true 
time-varying speciation—extinction rates 
are known. 

Even worse for those who want to use 
the rates of speciation and extinction to 
study evolution, the possible alternative 
scenarios of time-varying speciation and 
extinction rates that are consistent with the 
deterministic lineage-through-time model 
often differ qualitatively. For example, the 
authors show that a phylogeny of approx- 
imately 80,000 species of seed plant is 
equally well described by speciation and 
extinction rates that both gradually increase 
through time or that both gradually decrease 
throughtime. Other scenarios, including rates 
that vary wildly with time, provide equally 
good descriptions of the numbers of lineages 
throughtimeas derived from the deterministic 
lineage-through-time model. 

Louca and Pennell’s conclusions will be 
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dispiriting to evolutionary scientists who are 
looking for a link between past levels of spe- 
ciation and extinction and historical climate 
change or other environmental events, or 
who want to test ideas about what features 
of aspecies — suchas diet, mating system or 
the length of a generation — might be used to 
predict speciation and extinction rates*. The 
limitations that Louca and Pennell have iden- 
tified for estimating speciation and extinction 
rates do not go away as the size of the phylo- 
genetic tree increases. Nor do other common 
features of trees provide much help: for exam- 
ple, ifa group of species has never suffered any 
extinctions, estimating their speciation rate 
would be straightforward. But this is rare, and 
unlikely to be known in advance. Having abun- 
dant fossils could help, because they provide 
evidence needed to estimate extinction rates; 
however, fossils are seldom abundant. We can 
make assumptions about how speciation and 
extinction might vary with each other, through 
time, or withthe number of species, but these 
assumptions are being made about the things 
that we would like to estimate. 

Amid this epistemological carnage regard- 
ing what we can possibly know, the authors 
helpfully offer some consolation by showing 
that it is possible to estimate a parameter 
they call the pulled speciation rate, or A,. This 
measures the rate of change (the slope of 
the curve) of the deterministic model of the 
lineage-through-time plot. The pulled speci- 
ation rate can be compared between lineages, 
or at different times, and might be useful for 
understanding the processes that gave rise 
to the species that are alive today, even if not 
necessarily providing information about those 
species that didn’t make it. 

And this aspect — the ones that became 
extinct — is the deeper lesson of Louca and 
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to illustrate how different rates of extinction (and different speciation rates) 
can yield the same lineage-through-time plot. c, Information taken froma 
tree diagram can be represented in a lineage-through-time plot as shown. 
Red dots indicate the number of lineages at a given time that gave rise to 
lineages existing in the present. The slope of the curve equals the speciation 
rate minus the extinction rate. This plot is valid for both trees even though 
they have different speciation and extinction rates. This underscores the 
authors’ demonstration that many different data inputs can give identical 
lineage-through-time plots. 


Pennell’s work. Without fossils, all evolutionary 
scientists, whether studying speciation and 
extinction or attempting to reconstruct 
the features of distant ancestors, need to 
be aware that the evolutionary processes 
they identify are those that operated in the 
species that would survive and eventually leave 
descendants in the present. We can’t be sure 
what was going on in those that went extinct. 
It is the evolutionary version of the observa- 
tion that history is written by the victors. The 
supreme irony of this predicament is that 
Charles Darwin’s idea about the survival of the 
fittest, the story that we want to understand, by 
its very nature renders elusive some of the key 
components needed to study it. 
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® Check for updates 


Following its flyby and first imaging of the Pluto—Charon binary, the New Horizons 
spacecraft visited the Kuiper belt object (KBO) 2014 MU,, (also known as (486958) 
Arrokoth). The imaging showed MU,, to bea contact binary that rotates at alow spin 
period (15.92 hours), is made of two individual lobes connected by anarrow neck and 
has a high obliquity (about 98 degrees)’, properties that are similar to those of other 


KBO contact binaries inferred through photometric observations”. However, all 
scenarios suggested so far for the origins of such configurations*> have failed to 
reproduce these properties and their probable frequent occurrence in the Kuiper 
belt. Here we show that semi-secular perturbations®’ operating on only ultrawide 
KBO binaries close to their stability limit can robustly lead to gentle, slow binary 
mergers at arbitrarily high obliquities but low rotational velocities, reproducing the 
characteristics of MU, and other similar oblique contact binaries. Using N-body 
simulations, we find that approximately 15 per cent of all ultrawide binaries witha 
cosine-uniform inclination distribution>” are likely to merge through this process. 
Moreover, we find that such mergers are sufficiently gentle to deform the shape of the 
KBO only slightly. The semi-secular contact binary formation channel not only 
explains the observed properties of MU, but may also apply to other Kuiper belt or 
asteroid belt binaries and in the Solar System and extra-solar moon systems. 


The discovery of the bilobate shape of MU,, and its peculiar configura- 
tion provided new clues and opened avenues of exploration into the 
physical processes that sculpt the Solar System. Here we describe an 
evolutionary channel for the formation of MU,, from an initially wide 
binary. We consider the initial binary to bea member of a hierarchical 
triple together with the Sun. Owing to secular evolution induced by 
the Sun, the inner orbit may experience changes in its eccentricity (e) 
and mutual inclination (i) on secular timescales much longer than the 
orbital period, known as Lidov-Kozai (LK) oscillations, which can be 
modelled using a secular orbit-averaging approach”. Large LK oscil- 
lations take place when the mutual inclination is large (40° si< 140°). 
The highest eccentricities are attained as the binary evolves to the 
lowest inclinations and vice versa”. 

If the eccentricity of the binary exceeds a threshold e,,, the small 
pericentre allows binary collisions. Thus, LK evolution could lead to 
coalescence of individual Kuiper belt binary (KBB) members into a 
single, probably irregularly shaped, KBO°. However, because the closest 
approach occurs concurrently with the lowest inclinations, collisions 
mostly occur near i~40° and i~140° (ref. °). Moreover, tidal effects and 
the non-spherical structure of KBB components quench LK evolution, 
which makes collision possible only in a small part of the parameter 
space*™*, The standard LK mechanism is therefore disfavoured for the 
origin of the highly oblique MU,,, but can explain the origin of highly 
eccentric KBBs such as WW31 and 2001 QW322*°, 

For larger ratios of the inner period to outer period, secular averag- 
ing breaks down and the evolution becomes semi-secular. The orbit of 


the inner binary now evolves considerably on timescales of the outer 
orbit, and short-term fluctuations arise, making the LK evolution more 
complex®”””, The maximal eccentricity can be calculated analytically, 
including in domains where it is unconstrained’ and the evolution is 
non-secular. Figure 1a shows the analytical two-dimensional parameter 
space for allowed and forbidden domains for collisions in terms of the 
initial inclination cosi,. The initial separation of the inner binary is 
normalized to the Hill radius, r4y= oy;(,/3M.)”, where a,,, is the outer 
semi-major axis, M, is the mass of the Sun and m,, is the mass of the inner 
binary. For an inner semi-major axis a, the (dimensionless) separation 
a=a/r,cannot exceed the Hill stability limit for highly inclined orbits®, 
a, = 0.4. We use the outer orbit parameters of MUgo: doy, = 44.581 AU 
and eccentricity e,,,, = 0.041. We model the lobes as triaxial ellipsoids 
of dimensions approximately 22 x 20 x 7 and 14 x 14 x 10 km’ (ref. '), 
leading to a total radius R,,, = 18 km and inner mass m,, = (1.61 + 1.03) 
108 g = 2.64 x 10" g for a density of p = 1g cm’? (see Methods for 
other densities). Secular collisions occur only for sufficiently large 
critical inclination and beyond a certain initial separation a,,,, which 
overcomes LK quenching (see equation (12) and Methods). Non-secular 
collisions will dominate over secular collisions beyond a transitional 
separation a,: 
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Fig.1|Roadmap to collisions of MU,,. a, The orbital evolution of MU,, in initial 
separation-initial inclination space. The initial eccentricity, e, is O (blue), 0.4 
(green) and 0.8 (red). Solid lines show the condition for non-secular collisions 
(equation (14)) with unbound eccentricity. Dashed lines show the conditions 
for asecular collision (equation (4)) with deterministic eccentricity. The 
different domains are as follows: white, LK oscillations are completely 
quenched andthe eccentricity is constant, a< a, (equation (9)); green, the 
eccentricity is excited but belowe,,,, and collisions are avoided, a, <@< Qo 
(equation (12)); grey, secular evolution can lead to acollision, a. <a< a, 
(equation (1)); blue, non-secular perturbations dominate and lead to a collision; 
red, the initial inclination is too lowto induce a collision. b—-d, Time evolution of 
the instantaneous distance, inclination and eccentricity of an individual orbit 
with initial Keplerian elements: semi-major axis a=0.3r,, eccentricity e=0.1, 
inclination i= 86°, argument of periapse w =O, argument of ascending node 


OQ=t/4 and mean anomaly = 0. The outer binary is set at @,,,=Qo4,.=0 and 
Mout =~ 11/4. 


In our case, a, ~ 0.174. Figure 1b demonstrates the separation in the 
non-secular regime before the collision. During the high-eccentricity 
phase, there are about 10 cycles where the instantaneous separation 
drops below 10? km. A collision occurs during the third LK cycle after 
about 4,600 yr. The mutual inclination flips its orientation during the 
high-eccentricity peak of the LK cycle (Fig. 1c). The eccentricity is essen- 
tially unbound and a collision eventually occurs (Fig. 1d). 

To explore in detail the overall evolution and statistics of KBBs in 
the chaotic non-secular regime, we defer to detailed N-body simu- 
lations, which provide us with the probability for collisions and the 
post-collision characteristics. We use the publicly available code 
REBOUND ® with the IAS15”’ integrator (see Methods for details and 
stopping conditions). We integrate four sets of initial conditions in 
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the non-secular regime. The first three sets have initial separations of 
a=0.2, 0.3 and 0.4, and the fourth set has uniformly sampled separa- 
tions in a=[0.2, 0.4]. The orbital angles are sampled uniformly. The 
mutual inclination of observed binaries is cosine-uniform’, and thus 
we follow cosine-uniform sampling with a cut-off at |cosi| < 0.4 (lower 
inclinations cannot lead toacollision). For each case, werun 250 simu- 
lations (except for a= 0.2, for which we run 200 simulations and use 
|cosi| < 0.3), each up to5 x 10* yr. 

Figure 2 shows the cumulative distribution function of various 
parameters of the colliding orbits. Both the closest-approach dis- 
tance g = a(1- e)/R,,, (Fig. 2a) and the final inclinations (Fig. 2c) at 
collision are consistent with a uniform distribution (in cosi) between 
40° and 140°, suggesting that the orbits are indeed chaotic and in the 
non-secular regime, as expected. Most orbits induce collisions after 
about a few thousand years (Fig. 2b). The mean collision time increases 
with increasing separation. The velocity at impact is comparable to the 
escape velocity with a very small dispersion, consistent with a gentle 
collision”® (Fig. 2d). 

We find the overall merger fractions of wide binaries to be around 
12%-18% (see Extended Data Table 1), whichis roughly consistent with 
the observed 10%-25% occurrence of contact binaries for the cold 
classical belt”. Most mergers occur for initially high inclinations, as 
expected. About 1%-3% of all wide binaries produce highly oblique 
contact binaries (i= 80°-100°), consistent with the observed high 
obliquity of MU,,. and providing predictions that can be verified by 
future KBO observations. There is little dependence on the underlying 
distribution of a, and merger rates are bounded between minimal and 
maximal values of 12% (for a= 0.2) and 18% (for a=0.4). Moreover, ina 
collisional environment” the binary orbits can be perturbed such that 
originally low-inclination orbits become highly inclined and become 
subject to semi-secular evolution, forming contact binaries; the quoted 
formation rates are thus lower limits to the total fraction of contact 
binaries formed through this process. 

Thenon-merging systems continue to evolve quasi-periodically. On 
longer timescales, three-body encounters are expected to shape the 
populations of KBBs*”. Exchange interactions can drive the binaries 
into equal masses”, and the loose nature of the binaries can result in 
evaporation (Heggie’s law)”. There are only a handful of KBBs beyond 
az0.05r, with either prograde or retrograde orbits that are not highly 
inclined (see figure 1 of ref. *°), whereas the widest known binary, 2001 
QW;,,, With a = 0.21, is expected to disrupt within a billion years". 

To test the feasibility of the semi-secular collision origin of MU,, 
we also need to account for the observed spin period of MU,,. Angular 
momentum conservation enables us to find the resulting spin period 
depending on the impact angle and the primordial spins of each com- 
ponent. The final impact parameter at collision (which corresponds 
to an impact angle; see Methods) is uniformly distributed, and thus 
our model can robustly produce a wide range of possible final rotation 
periods, without any fine-tuned modelling of the composition and 
density of MU,o, thus also alleviating the angular momentum problem 
of other models'. 

Figure 3a shows the outcome of acollision at a 40° impact angle with 
high-material-strength composition, which reproduces the shape of 
MU,,. Low- or medium-strength materials result ina deformed shape 
and are thus ruled out. If the density of MU, is halved compared to 
the fiducial 1 g cm™ value (as suggested by ref. '), the escape veloc- 
ity V.-—at which typical collisions occur—is lower, and thus using 
medium-strength-material parameters also produces an undeformed 
shape. Random collisions—even at relative velocities as low as 10v,,.— 
destroy or heavily deform binaries with high-strength-material com- 
position; they are likewise ruled out (see Methods). Figure 3b shows 
the expected spin-period dependence onthe impact angle. Animpact 
angle of about 40° reproduces the observed spin period (see Methods) 
for initially non-spinning objects. Taking a typical initial spin period 
of about 10 h with random orientations extends the range of plausible 
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Fig. 2| Cumulative distributions of the impact characteristics. The 
cumulative distribution functions (CDFs) are for a= 0.4 (solid blue), 0.3 (dashed 
red), 0.2 (dotted green) and uniform in (0.2, 0.4) (dash-dotted black). a, 


impact angles to about 20° and 70°, for the maximally aligned and 
anti-aligned configurations, respectively. smoothed particle hydrody- 
namics (SPH) collision simulations agree with our simplified estimate 
and support our assumptions of undeformed, rigid bodies when mod- 
elled with high-strength material parameters, or else medium-high 
strength parameters if the density and impact velocity are slightly 
lower (see ref. ”” and Methods for details). 

Together, our dynamical and post-collisional modelling yields a 
coherent picture for the origin of MU,, from an ultrawide KBO binary. 
Such wide KBB progenitors could be a natural byproduct of KBO and 
KBB evolution in the early Solar System?”®”’, It is most probable that the 
characteristics of MU,, are not unique, and that secular or semi-secular 
evolution plays a major role in the evolution of many KBBs and in the 
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Fig. 3 | Shape and spin period of MU,,. a, Final collision outcome at an impact 
angle of 40° with high-material-strength composition (see Methods). b, Spin 
period asa function of the impact angle for MU,o. The horizontal line is the 
observed period, 15.92 h. The solid blue line indicates initially non-rotating 
progenitors. The dashed red line indicates two initially aligned rotating 
progenitors witha period of 10h, and the dotted green line indicates an 
anti-aligned configuration. The black crosses are the results obtained from 
SPH simulations (see Methods). 


Pericentre q/R,,:. b, Time of collision. c, Final inclination at impact. The shaded 
grey lineis a uniform cumulative distribution in cosiin the range 30°-150°. 
d, Velocity at impact. The vertical black dashed line is the escape velocity, v.,.. 


production of low-velocity collisions between individual KBB com- 
ponents. In fact, modelling of the Pluto-Charon system also suggests 
alow-velocity impact origin*°. Moreover, given the high obliquity of 
the Pluto-Charon system, it is possible that it also originated from an 
initially wide binary and followed a secular or semi-secular evolution, 
similar to MU,,. Similar evolutionary scenarios might also apply to the 
evolution of other contact binaries such as (139775) 2001 QG298 (ref. 7), 
as well as moons and exo-moons, as all of these form hierarchical triple 
systems with their host star. 
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Methods 


Lidov-Kozai secular evolution 

Let us consider the evolution of a binary KBO due to LK secular evolu- 
tion. Let the inner binary start with an initial separation of a,=ar,,eccen- 
tricity e, and mutual inclination ip. The Hill radius is ry= @oy¢(Min/3M). 
The most stable orbit is around a, ~ 0.4 (ref. 8). The minimal eccentricity 
required for collision is 


Reot 


Coon = 4 ah, (2) 
Using the standard LK formula, the maximal eccentricity is 
2 5 2 2. 
Cmax = 1- 3 (1- eo)cos ig (3) 


For a collision occur, we require that e,,,, = Ceo, Which yields the 
critical inclination ig 


6 Reo _ 0.037 ( a ie 
y5Q-e6) 4% — f1—e2 \0.3 


such that collisions occur for |cosio| < |cosig} Here, Rio/fy «1 has been 
expanded to linear order. The probability for collision can be expressed 
interms of integrating over the distribution function/, g(a, 8) and where 
6=cosi. For uniform independent distribution in cosi, as inferred from 
KBO observations”, the probability is 


sens 
cos ig = 


(4) 


Ples)=J”£ (@aadda 5) 


where 0.(q) is given by equation (4). 


Inclination angle at impact. From the conservation of \/1- e?cosi = 
const =/, the inclination at impact is 


{1-e3 coSsig _ er 
coll= 7 —— = y1- €4 COSig | Si, (6) 
f1- Ccoll tot 


To find the impact angle i,,,,, we invert equation (6): 


cosi 


: . {2k 

ig= 2ros{ costa mt (7) 
To find the observed obliquity i,,, = 98°, the critical inclination is at 

least |coSip| < 0.00585, (89.33° < ip < 90.66°) for a= 0.1, whichis unlikely. 

Moreover, for a < 0.1, collisions are unlikely regardless of the initial 

inclination, owing to the effects of oblateness, as shown below. 


Effects of oblateness. Small KBOs might not have spherical shapes, in 
which case their gravitational potential is not spherical. Suchaconfigura- 
tion induces extra precession on the orbit which can considerably affect 
the secular evolution. The leading term is encapsulated ina dimension- 
less parameter /,, which is related to ratio of the axes, or the polar and 
equatorial radii of the bodies”. Planets are mostly spherical, and their 
deviation is small—/, ~10* for Earth and around/, = 0.014 for Jupiter—and 
is related to the flattening of the planets induced by their rotations. In 
the case of the components of MU,s, the objects are highly non-spherical 
and/, could be large. Using the principal moments of inertia of an oblate 
spheroid, we have/,=[1-(c/a)”]/5, whichis around, ~ 0.18 for the primary 
component and/, = 0.1 for the secondary component. 

The additional precession may quench the LK oscillations if it is too 
strong. To quantify the effects of the additional precession we can 


define a dimensionless quantity, ¢,,,, that measures the ratio between 
the LK-induced and the oblateness-induced precessions” 


3 —p2 )3/2p2 
€= ay Min Furl Cour) Rj 
PED ATR (an,)° 


(8) 


Setting €,..= 3/2 leads to the definition of the Laplace radius®™ in 
terms of the Hill radius, r, = a,7,, where 


YS/ 3 y2)1/5 1/5 
Mm, Auk R 
a, = e in ) ( a i) ar a, e2,,)/° 0.03( J; 1 


2/5 
(9) 


leading to” ¢,,, =1.5(a/a,) >. It has been previously shown that the maxi- 
mal eccentricity attained is given by the implicit expression for cosi,=0 
(their equation 50): 


Mout 4 0.2 11km 


é 1 9 
3 lia yr -1)- gem ” 
max: 


Cmax g frot (11) 
Acollision can occur only if €.61 < max OF if 
2/7 3/7 
3 2h 
Xeoll* (5) ae E ze) =0.12 (12) 


Note that a similar analysis can be performed for tidal distortions 
or relativistic corrections”. In this case, they are much weaker than 
the rotational effects. 


Non-secular Lidov—-Kozai evolution 

Inthe previous section we considered the evolution due to secular LK 
evolution. In the semi-secular (semi-LK) regime®’, short-term fluctua- 
tions can substantially change the evolution. In the following we discuss 
the overall effects of such short-term perturbations. The strength of 
the perturbations is encapsulated in the single averaging parameter’: 


: -( a, le Wlaae i. q/2 =o a y" 
a bout \ Min J3(1-e2,,)°/ 0.3 


One important quantity is the (averaged) z angular momentum 
i =,/1- 2 cosig, assuming that i, and e, have their mean value. 

Inthis case, j,is no longer conserved, but its value averaged over the 
outer orbit, i, is conserved. The eccentricity of the orbit becomes 
unbound once the fluctuationin j (Aj,) is larger than its initial value, 
namely Aj, >j,. The fluctuation has been estimated analytically”, and 
can be used to show that the eccentricity is unbound if 


3/2 
cosi, j1-e3 S$ 284 = 0.118 (os) 


where &cq = €sa(1 + 2/2 €ut/3) * 1.039€5, has been defined for con- 
venience. 

The width of the non-secular semi-LK regime increases with a, 
whereas the width of the secular LK regime decreases with increasing 
a. Comparing equation (4) and equation (14) yields the transitional 
separation a, found in equation (1). 


(13) 


(14) 


Spin period 

There is little evidence for structural changes of MU,, since its for- 
mation, and the spin period is believed to be primordial’. In our 
model, the collision is gentle and occurs at relatively low velocities 
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(Vege = 442.4 cm Ss for our nominal density and v,,. = 3.128 cm s™ for 
the lower density of 0.5 g cm assumed in ref.'), such that almost any 
impact parameter (or impact angle) is allowed. Therefore, to obtain the 
observed spin period, we can use the standard arguments of angular 
momentum conservation and derive the impact parameter (or impact 
angle) that yields the desired spin rate. 

Consider triaxial ellipsoidal bodies iwith masses m,and axes a;>b;>¢;, 
with i=1,2.We assume that the major axes a, are parallel, similar to the 
observed object, and that the collision occurs in parallel with the major 
axes. The largest moment of inertia is /? = ma? + b;)/S. 

After the collision the distance between the centre of masses of 
the joint body and each centre of the ellipsoid is r;. Then the principal 
moment of inertia of the joint body is 


IS = 19) +13) + myrt + mgr} = 
i=l 


M (2.72. ,2 
5 (4 +b; +5r?) 

Now, the ellipsoids collide with relative velocity v,,.and impact param- 
eter b. The orbital angular momentum is L, = wbv,,,, where = m,m,/ 


(m,+m.,) isthe reduced mass. If the two bodies are non-rotating, then 
the joint angular frequency is 


L, Sub Vese 


~ tot ~ 
Bie p Mavens iy ph ed 
I; m,( a} +bi+ 5r7) + m,(a3 +b>+ 5r3) 


a (15) 


If the individual bodies are rotating around the z axis with frequen- 
cies O,, the additional angular momentum of each body is 190, for 
i=1,2, and thus equation (15) becomes 


= SHDVesc 3 m,(a? 2 bt), - m,(a? a bt) O, 


2 2 
m,(a? +bi+ Sr?) + m,(a} +b5+ 5r3) 


For animpact angle 6, the distance of the point of contact to the cen- 
tre of each ellipsoid is = /a?cos6 + bjsin’@. The impact parameter 
is related to the impact angle by sinO = b/(€, + €,) = b/d. The distances 


from the centre of mass are r,=m,d/(m, + m,) and r, = m,d/(m, + m,) 
and the spin rate is 


(16) 


5pudv,.. sinO + m,(a? + bi), + m,( a? + bi)O, 
Q= (17) 
Sud? + m,(a? + bi) + m,(a3 + b3) 


Figure 3 shows the spin-period dependence on theimpact angle for 
the typical parameters of MU... The spin period is P= 2m1/Q, where Qis 
given by equation (17), and when there is no internal rotation, Q,=Q,=0. 
We see that an impact angle of about 40° gives the observed spin period. 
We have performed hydrodynamical simulations based on the code of 
ref.” that qualitatively agree with our assumptions and produce similar 
results. Typical classical KBOs could have primordial spin periods® 
with comparable contributions to the angular momentum budget. 
Recently, it was found” that the mean cold classical KBO spin period is 
9.48 + 1.53 h. Generally, there is no reason for the spin vectors of each 
body to be correlated, so on average the contribution is zero. In extreme 
cases, the spin vector of both objects could be aligned or anti-aligned 
with the orbital angular momentum. In these cases, a large range of 
impact angles and spin configurations are possible, resulting in the 
observed spin period after the collision. For a typical period of 10 h, 
the impact angle is about 20° in the aligned case, and about 70° in the 
anti-aligned case. 


N-body stopping conditions and tests 

We impose a stopping condition that the distance between the two 
bodies is less than their mutual radius. During the non-secular highly 
eccentric passage, the change in the pericentre gis much faster than the 


inner orbital period (this is the definition of the non-secular regime), 
and hence the orbital elements are not reliable at this stage. Once the 
simulation stops it records the orbital elements at impact, which we 
use for our statistics, but these are not involved in the stopping con- 
dition. From the output we know the closest approach at impact. We 
have tested the stopping condition by varying it to be slightly smaller 
or larger than the g found in the first run. Indeed, when the stopping 
condition was below g the objects did not collide and the code con- 
tinued running. We thereby concluded that the collision is physical 
and reliable. 

Extended Data Table 1 shows the merger fractions from the simu- 
lations. The total merger fraction f, is the total number of mergers 
divided by the initial number of runs, multiplied by the relative frac- 
tions of the inclination distribution, assuming that no mergers occur 
outside the sampled inclination distribution. The fraction fgo-;99 iS 
calculated in the same way, except that the only mergers considered 
are those where the mutual inclination during the merger is within 
the designated boundaries of 80°-100°. For example, for a = 0.2, the 
merger fraction is 78/200 = 0.39. Multiplied by the range of the inclina- 
tion distribution, f,= 0.39 x 0.3 = 0.12 and the high-obliquity merger 
fraction is fgo-100 = 9/200 x 0.3 = 0.014. For a= 0.3, the merger fraction 
is 99/250 = 0.396. Multiplied by the range of the inclination distribu- 
tion, f;= 0.396 x 0.4 = 0.158 and the high-obliquity merger fraction is 
feo-100 = 12/250 x 0.4 = 0.019. 


Impact modelling 

We perform hydrodynamical collision simulations using our SPH code”, 
which treats self-gravity, gas, fluid, elastic and plastic solid bodies that 
have a material strength, including a porosity and fracture model that 
can be applied for small-body collisions®*””. In order to treat numeri- 
cal rotational instabilities, a tensorial correction scheme’ is imple- 
mented. The miluphCUDA code is implemented with CUDA, and runs 
ongraphics processing units (GPUs), withan improvement of approxi- 
mately one to two orders of magnitude for asingle GPU comparedtoa 
single central processing unit (CPU). The code has previously been 
successfully applied to several studies involving impact pro- 
cesses?°3739-47, 

For the porosity treatment, we implement the P-a model*®”, in which 
the pores are much smaller than the spatial resolution and cannot be 
modelled explicitly. Here, the total change in the volume depends both 
onthe compaction or collapse of the pore space and on the compres- 
sion of the solid material that constitutes the matrix. The dependence 
is expressed in terms of a porous material pressure P and density @ as 
P/g=P,/o,, where P, and g. are the pressure and density of the solid 
matrix material, respectively. The distention parameter @ = @,/eisthe 
ratio between the solid matrix material and the porous material densi- 
ties, and relates to the porosity p via y =1-1/a. For the solid matrix 
material we use the Tillotson equation of state (EOS) parameters” with 
a reduced bulk modulus of A = 2.67 x 108 Pa (the leading term in the 
EOS) to take into account the smaller elastic wave speeds in porous 
materials compared to solid materials, consistent with previous work™, 
Our matrix density is chosento be 2g cm, about the same as that used 
previously”, which leads to a 50% porosity for our fiducial bulk density 
for MU, ,1g¢.cm™®. The matrix density and the initial porosity are both 
in rough agreement with what might be expected from an object of 
this origin and size range. In particular, the former constrains the rock- 
ice mass ratio to be about 3-4 (depending on the exact choice of silicate 
grain density), which could be compatible with this type of KBO*?*°. 
However, we note that given the uncertainties involved, we seek only 
to obtain a rough estimate of the density that will permit us to test our 
working hypothesis. We then also run simulations with 75% porosity 
and half the previous bulk density to establish the qualitative differ- 
ences between these two setups. 

For collisions between small porous bodies, compressibility is limited 
bythecrushcurve for @ for typical pressures, instead of by the Tillotson 


EOS parameters. We thereby choose three sets of crush-curve param- 
eters”, using a simple quadratic crush curve”: 


(R-P)? 


eS” pe py 


(18) 


where @ = 2, P. is the transition pressure between the elastic and plas- 
tic regimes and P, is the pressure of full compaction. Both P. and P, are 
listed in Extended Data Table 2. As ref. * treats comet 67P/Churyumov- 
Gerasimenko, which belongs to a class of much smaller and active 
objects, we assume MU,, is probably fluffier and more porous. Hence, 
our low-strength crush-curve values correspond to the previous 
high-strength values”, and taking the same modelling approach we 
then increment the parameters in each subsequent model by one order 
of magnitude. 

Fracture and brittle failure are treated using the Grady and Kipp 
fragmentation prescription ©, which is based on randomly dis- 
tributed flaws in the material following a Weibull distribution with 
material-dependent parameters. The lowest activation threshold strain, 
k, derived from the Weibull distribution, is given by x= kV“, where Vis 
the volume ofthe brittle material and kand mare the material-dependent 
Weibull parameters. We adopt m= 9.5 for pressure-dependent failure”. 
The volume is calculated given the dimensions of the MU,, binary. 
From the material strength parameters K and G, Young’s modulus 
Emay be calculated as F = (9KG)/(3K + G). Here K = 2.67 x 10° Pa, which 
is the leading term in the Tillotson EOS, and G=1.6 x 10° Pa. Finally, for 
undamaged material, x = Y,/E, where Y; is the tensile strength givenin 
table 30 of ref. **. k may thus be extracted and is k= 10“, 2 x 10 and 
2 x10” m? for the low-, medium- and high-strength-material setups, 
respectively. Damage accumulates when the local tensile strain reaches 
the activation threshold of a flaw. 

For the plasticity model we use a pressure-dependent yield strength” 
following the implementation of ref. *. The yield stress Y, is different 
for damaged and intact material. For intact material, the yield stress is 
Y,=Yo+ pP/(1+ :P/(Yu— Yo)), where Yq is the cohesion (again see table 
30 of ref. ”), u,is the coefficient of friction and Yy is the shear strength 
at P=, We adopt py; = 1.5 (ref. “) and a typical Yy,=1.5 x 10° Pa (ref. ©), 
which is appropriate for an object composed of ice, rock and organ- 
ics. For P=0, we recover the pressure-independent form Y, = Y,. For 
damaged material the yield stress is ¥,=1gP, where 1, is the coefficient 
of friction of the damaged material. Here we take 1, = 0.6, following 
ref.“ and thus fully damaged particles still undergo some shear stress. 

Extended Data Fig. 1 shows additional results of our simulated 
impacts. We obtain the rotation period of MU,, using the nominal den- 
sity of 1g cm? only when using the high-strength-material parameters. 
Medium-strength (Extended Data Fig. 1a) or low-strength (Extended 
Data Fig. 1b) materials deform MU,, and do not produce the observed 
shape of a gently merged contact binary. If the nominal density is 
halved (0.5 g cm’’), the impact velocity v.,, is lower, which produces 
less deformation in our simulations. Even medium-strength material 
parameters generate a gently merged contact binary for virtually all 
impact angles (Extended Data Fig. Ic, d). Here, we used the 55° impact 
angle, for which the observed spin period of MU,, is approximately 
obtained. In Extended Data Fig. 2 the shape is considerably deformed 
after the collision if the impact velocity is larger than v = 10v,,,, using 
high-strength material parameters. The same velocity with weaker 
material parameters leads to a complete disruption of MU,o. 

Our simulations were performed for a grid of impact angles, assum- 
ing pre-alignment of the two lobes. Simulating higher impact angles 
and low- and medium-strength-material compositions causes the two 
lobes to interact in other ways: insome cases they hit and create contact 
craters and then roll on top of each other; in other cases they collide, 
bouncing off each other instead of rolling, and then return to re-collide 
following a (now) shorter orbital period. These formation channels 
may also generate compatible shapes but, so far, not the exact rotation 


period of MU,,. A full investigation of the collision phase space must 
also include the initial self-rotation of each lobe in addition to inclined 
hits. This requires a huge collision phase space, exceeding the scope 
of this work, and necessitates a dedicated hydrodynamical study. Pre- 
liminary results (in preparation) indicate that such an approach may 
yield more channels through which the unique orientation of MU,, 
might be generated, besides the successful cases for pre-aligned binary 
components that are shown here. 

Our standard resolution is 5 x 10° SPH particles. We have additionally 
preformed simulations with 10° and 2.5 x 10° particles. Test simula- 
tions were performed on the TAMNUN GPU cluster at the Technion 
Institute in Israel and production runs on the bwForCluster BinAC at 
the University of Tubingen, Germany. 


Data availability 


The data that support the findings of this study are available from the 
corresponding author upon reasonable request. 
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Extended Data Fig. 1| Additional results of the collision models. a, 40° impact angle, medium-strength material. b, 40° impact angle, low-strength material. 
c,d, Low-density model (0.5 g cm’) with animpact angle of 55° and medium-strength material. The edge (c) and face (d) views are given. 
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Extended Data Fig. 2 | Additional results of the collision models. 5° impact 
angle, high-strength material and large escape velocity, v=10U,... 


Extended Data Table 1| Merger rate of the binaries in the non-secular regime 


ce 
fos-100 0.029 | 0.024 


N,», total number of mergers. 

Ngo-100, number of mergers with inclinations 80° <i <100°. 

f,, total merger fraction normalized to the inclination sampling rate. 

fs9-400, merger fraction for only those mergers with inclinations 80° <i < 100°. 
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Extended Data Table 2 | Crush curve, plasticity and fragmentation parameters 


Type Yr (Pa) 
Low-strength 10° 
Medium-strength 104 
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Unidirectional radiation is important for various optoelectronic applications, such 

as lasers, grating couplers and optical antennas. However, almost all existing 
unidirectional emitters rely on the use of materials or structures that forbid outgoing 
waves-—that is, mirrors, which are often bulky, lossy and difficult to fabricate. Here we 
theoretically propose and experimentally demonstrate a class of resonances in 
photonic crystal slabs that radiate only towards one side of the slab, with no mirror 
placed on the other side. These resonances, which we name ‘unidirectional guided 
resonances’, are found to be topological in nature: they emerge when a pair of 
half-integer topological charges’ *in the polarization field bounce into each other in 
momentum space. We experimentally demonstrate unidirectional guided resonances 
inthe telecommunication regime by achieving single-side radiative quality factors as 
high as 1.6 x 10°. We further demonstrate their topological nature through far-field 
polarimetry measurements. Our work represents a characteristic example of applying 


topological principles*» to control optical fields and could lead to energy-efficient 
grating couplers and antennas for light detection and ranging. 


Topological defects', which are characterized by quantized invariants, 
offer a general description of many exotic phenomena in real space, 
such as quantum vortices in superfluids and singular optical beams’. 
Ithas been recently found that topological defects can also emerge in 
momentum space, giving rise to interesting effects. One such example 
is bound states in the continuum? (BICs) in photonic crystal slabs: these 
guided resonances reside inside the continuous spectrum of extended 
radiating modes, yet counter-intuitively remain spatially confined and 
maintain infinitely long lifetimes. Since they were initially proposed’, 
BICs have been demonstrated ina variety of wave systems® ~° and have 
led to various applications”. Recently, the topological defect nature of 
BICsin photonic crystal slabs was discovered: BICs are vortices of polari- 
zation major axes in momentum space that carry integer topological 
charges”>>, The lack of a continuous definition of polarization at the 
vortex centre forbids the emission of far-field radiation from BICs. So 
far, most BICs have been demonstrated in up-down mirror-symmetric 
structures”, in which the observation of no upward radiation neces- 
sitates the absence of downward radiation. On the other hand, the 
existence of unidirectional guided resonances (UGRs)—resonances 
that radiate only towards one side of a photonic crystal slab, with no 
mirror placed on the other side—has not been confirmed so far”””8. 
Here we theoretically propose and experimentally demonstrate UGRs 
in photoniccrystal slabs that are enabled by topological charges. Specifi- 
cally, we first split the integer topological charge carried by aBICintoa 
pair of half-integer topological charges”’; each charge correspondstoa 
circularly polarized resonance. As the structure is continuously varied, 
the two half-integer topological charges in the downward radiation keep 
evolving in momentum space until they bounce into each other and, 
again, act as an integer charge. At this point, downward radiation from 


this resonance is disallowed because its far-field polarization is again 
undefined—this is the topological origin of UGRs. Because up-down 
mirror symmetry is broken, the UGRs can still radiate towards the top 
side, unlike traditional BICs®, making them potentially useful as low-loss 
grating couplers to efficiently couple light both on and off chips. 


Numerical design and topological interpretation 
Asaspecific example, we consider a one-dimensional periodic photonic 
crystal slab in which infinitely long bars with gaps of width w=358 nm 
are defined in a500-nm-thick silicon layer with refractive index of 
n=3.48 ata periodicity of a=772 nm (Fig. la—d). Both the top and bot- 
tom silica cladding layers (n = 1.46) are assumed to be semi-infinitely 
thick. When the sidewalls of the bars are vertical (@ = 90°; Fig. 1b), the 
photonic crystal slab is up-down and left-right symmetric, and a BIC 
is found on a transverse electric (TE)-like band (TE1) along the k, axis 
offthe normal direction at k,a/(21t) = 0.176. In this up—down-symmetric 
structure, the radiative decay rate of amode towards the top (y,; orange) 
of the photonic crystal slab is always the same as that towards the bot- 
tom (y,; blue), and both rates are reduced to O at the BIC (middle panels, 
Fig. 1b). Fundamentally, this BIC is a topological defect in the far-field 
polarization major axes that carries an integer topological charge of 
q=1, defined as: 


q= §. dk-Y.p(h) (a) 


Here @(k) is the angle between polarization major axis and the x axis and 
kis the in-plane wave vector. Cis a yellow closed path in Fig. 1b, which 
goes around the BIC in the counter-clockwise direction. 
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Fig. 1| UGRs and their topological nature. a, Schematic of a photonic crystal 
slab supporting a unidirectional guided resonance (UGR). When the sidewall is 
vertical, aBIC is found ona TE band, labelled by red circles in the right panels. 
Q, accounts for radiative loss towards both top and bottom, Q,=@/(y, + yp). 

b, When the sidewall is vertical, radiative losses from resonances towards the 
top (y,; orange line) are equal to those towards the bottom (y,; blue line) owing 
to up-down symmetry, and both reduce to Oat the BIC. The polarization major 
axes wind around the BIC and have a topological charge of q=1.c, When the 
sidewall is tilted from the vertical direction, the g=1 topological charge splits 
into a pair of half charges (gq =1/2) with opposite helicities, LCP (red) and RCP 
(green). d, When the sidewall angle 6 changes to 75°, a UGR is achieved: 
radiation towards the bottom is eliminated (y, = 0) while radiation emitted 
towards the top (y,) remains finite. e, Trajectories traced by the two half 
charges (red and green) inmomentum spaceas 6is varied. 


When one of the sidewalls is tilted away from the vertical direction 
(8 = 81°; Fig. 1c), the photonic crystal slab is no longer up-down sym- 
metric, and y, and y, are no longer simply related. No BIC exists in this 
structure any more; the radiative decay rate towards the top or the 
bottom (y,,) never reaches O (middle panel). On the other hand, the 
total winding of the polarization major axes remains +27 because the 
winding number is a conserved quantity. Consequently, the integer 
charge g=1is split into two half-integer charges of g = 1/2, each being 
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Fig. 2| Numerical confirmation of aUGR. a, Unit cell design of a photonic 
crystal slab supporting a UGR that only radiates towards the top, as shown inits 
mode profile (E,). a.u., arbitrary units. b,c, Colour map (b) and line cut (c) 
showing that the asymmetry ratio between upward and downward radiative 
loss, N=Y,/Yp, diverges to infinity at the UGR. The polarization major axes 
(arrows) show a +2 winding around the UGR, whichis consistent with Fig. 1d. 


acircularly polarized resonance (bottom panel). The two half-integer 
charges are related to each other by the y-mirror symmetry of the 
structure, which also guarantees that these two circularly polarized 
resonances are opposite in helicity: left-handed circularly polarized 
(LCP) for one (red) and right-handed circularly polarized (RCP) for 
the other (green). 

When the sidewall is further tilted, the two half-integer charges in 
the downward radiation keep moving in momentum space, following 
the trajectories shown in Fig. le: red for LCP and green for RCP. Neither 
of the radiative decay rates (y,,,) is reduced to 0 until 8 is decreased to 
75° (Fig. 1d), where the LCP and RCP trajectories meet on the k, axis. 
At this point, any downward radiation needs to be both LCP and RCP 
at the same time, which can never be satisfied. As a result, this guided 
resonance cannot have any downward radiation, even without a mir- 
ror on the bottom-—this is what we call a UGR. From the viewpoint of 
topology, UGRs can be understood as the merging point between two 
half-integer charges, where they act like an integer charge, forbid- 
ding any radiative loss. This topological interpretation agrees with our 
numerical simulation results, where y, reaches O whereas yp, remains 
finite (middle panel of Fig. 1d). We note that the lack of certain symme- 
tries in our structure (both C, and up-down mirror) is crucial to achieve 
UGRs; see Supplementary Information sections 1-3 for more details. 

Next, we present our UGR design (Fig. 2a). The photonic crystal 
slab consists of a periodic array of one-dimensional bars defined ina 
500-nm-thick silicon-on-insulator wafer at a periodicity of a=825nm 
(left panel). The top cladding material is air and the bottom cladding 
is SiO. The sidewalls are tilted to specific angles, 0, = 79° and 0, = 75°, 
to achieve a UGR: as shown in the £, mode profile (right panel), the 
downward radiation y, is considerably lower, by more than 70 dB, 
than the upward radiation y,. The asymmetry ratio between upward 
and downward radiation intensity, 7 =y,/y,, is calculated for different 
k points (colour map, Fig. 2b), where the extremely bright spot marks 
the location of the UGR at k,a/(21t) = 0.0854. A line-cut of the colour map 
along the k, axis shows the asymmetry ratio 7 diverging into infinity, 
which is the characteristic feature of unidirectional radiation (Fig. 2c). 


Bottom 


Fig. 3 | Fabricated sample and experimental setup. a, b Scanning electron 
microscope images of the fabricated photonic crystal sample from tilted 
(a) and side (b) views. c, Schematic of the setup used to independently measure 


Overlaid onthe colour map in Fig. 2bis the plot of the polarization major 
axes for the downward far-field radiation from nearby resonances. 
An integer winding of the polarization major axes, g=1, is observed 
around the UGR, which is consistent with the topological interpreta- 
tion presented in Fig. 1d. 


Sample fabrication and experimental setup 


To verify our theoretical findings, we fabricate photonic crystal sam- 
ples with UGRs using plasma-enhanced chemical vapour deposition, 
electron-beam lithography and reactive ion etching (RIE) processes. 
The scanning electron microscope images are shown in Fig. 3a, b. 
Briefly, a thermal SiO, layer with a thickness of approximately 110 nm 
is first deposited on the wafer as the hard mask. Unlike standard RIE 
processes that use horizontal substrates, our sample is placed ona 
wedged substrate that allows us to etch the silicon layer at a slanted 
angle; as aresult, high-quality air gaps with tilted sidewalls are achieved 
(Fig. 3b). Because of the shadowing effect, the angles of the left and right 
sidewalls are not identical: 0, = 79° and 6, =75°. The width of the air gap, 
w, is swept from 320 nm to 340 nm to best capture the UGR design at 
w=331nm. See Methods for more details about the fabrication. 

To demonstrate UGRs, the upward and downward radiative decay 
rates from our fabricated samples are independently characterized 
using the experimental setup schematically shown in Fig. 3c. Atunable 
telecommunication laser inthe C+L band is first sent through a polarizer 
in the x direction before it is focused by a lens (L1) onto the rear focal 
plane of an infinity-corrected objective lens. To achieve on-resonance 
coupling, the incident angle is tuned for each excitation wavelength 
Aby moving L1in the x-y plane, exciting a resonance in the sample. 
Each excited resonance radiates towards the top (bottom) according 
toits radiative decay rate y, (y,) into this channel. Upward (downward) 
far-field radiation from this resonance is then collected by the confocal 
setup shown on the right (left), marked with a orange (blue) dashed 
box, where the beam is shrunk by 0.67 times through a 4f system to 
best fit the camera. This on-resonance excitation scheme is similar to 
previously reported results*°™. See Methods for more details on the 
experimental setup. 


Experimental results 


As an example, the experimental comparison between upward and 
downward radiation froma resonance at A=1,551 nm is shown in Fig. 4a. 


the upward and downward radiation intensities from the guided resonances in 
the photonic crystal sample. L, lens; Obj, objective; RFP, rear focal plane; PD, 
photodetector; POL, polarizer; BS, beam splitter; 4f, relay 4f optical system. 


Here, the excitation laser is on resonance with a mode on the k, axis at 
k,a/(2t) = 0.01. Momentum space is labelled with respect to the known 
numerical aperture of the objectives (NA = 0.26), shownas white circles. 
The characteristic feature of the UGR—marked by a white arrow on 
the k,. axis—is qualitatively shown in the comparison between the two 
figures: for resonances near the white arrow, the downward radiation 
(X’, YZ’) is always much weaker than the upward radiation (X, Y, Z). 
On the other hand, for resonances far from the UGRs (for example, to 
the left of the k, axis), the upward and downward radiation are compa- 
rable. We note that although UGRs radiate only towards a single side 
(top), their in-plane propagation is not immune to back-scattering from 
fabrication disorder suchas the chiral edge states in a Chern insulator, 
because our structure is reciprocal. 

A more quantitative demonstration of the UGRs is achieved by 
measuring the up-down asymmetry ratio 7 = y,/y, of the resonances. 
Two movable pinholes (not shown in Fig. 3c) with diameters of 300 pm 
are placed at the image planes of the rear focal planes of the objec- 
tives to select specific k points. Three examples are shown in Fig. 4b, 
where upward (X, Y, Z) and downward (X’, Y’, Z) radiation intensities 
are measured by two photodetectors as the excitation wavelength 
scans through the three resonances. As expected, all measured spec- 
tra exhibit symmetric Lorentzian features”: the excitation efficiency 
reaches its maximum when the excitation is on resonance, which hap- 
pens atA=1,553.7 nm, 1,551.2 nm and 1,549.4 nm. Accordingly, both the 
central wavelengths and the total quality factors of the resonances 
can be extracted by fitting the experimental results. By repeating this 
procedure for all resonances along the X-Zline, we achieve good agree- 
ment between experiments (red crosses) and numerical simulation 
(blue line, Fig. 4c). 

We further measure the downward radiative decay rate of the reso- 
nances, y, = @/Q,, and show that it is reduced to O at the UGR. Here, w 
is the resonance frequency and Q, is the radiative quality factor that 
accounts only for the downward radiation. In practice, the observed 
total loss y,.,= @/Q,., is composed of non-radiative loss Yron-rad = @/Qnon-rad 
(including absorption, scattering, and lateral leakage), as well as radia- 
tive losses towards the top and bottom: 


Not = Yron-rad : K - Yb (2) 


Because these resonances are close in momentum space and share 
similar mode profiles, it is reasonable to assume that they share a similar 
non-radiative quality factor, which is found to be Q,,,.a4= 2,080 through 
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Fig. 4| Observation of UGRs. a, Upward and downward radiation intensities 
from resonances under an excitation wavelength of 1,551nm. In the vicinity of 
the UGR onthek, axis, marked by a white arrow, the downward radiation 
intensities (X’, Y’, Z’) are considerably suppressed compared to the upward 


numerical fitting (see Methods for details). Upward and downward 
radiative decay rates can be further separated based onthe measured 
asymmetry ratio 7 = y,/y, (see Extended Data Fig. 3 for measurement 
results of 7). Experimentally extracted Q, values are presented in Fig. 4c 
as red crosses, showing good agreement with the numerical simulation 
results (blue line). In particular, the fact that the bottom radiation y, is 
reduced to almost 0 at k,a/(211) = 0.088 proves the existence of UGR. 

To demonstrate the topological nature of UGRs, we perform pola- 
rimetry measurements” ona series of five samples with slightly dif- 
ferent widths w. For each sample, we experimentally locate the two 
half-integer charges in momentum space (symbols in Fig. 5a), which 
show good agreement with the simulation results (dashed lines). 
See Methods for more experimental details of the polarimetry meas- 
urements. The perfect design with a UGR is marked with an arrow. As 
shown, when w increases from 0.399a (marked by inverted triangles) 
to 0.403a (diamonds), the two half-integer charges switch positions. 
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Fig. 5| Observation of the topological nature of UGRs. a, Experimentally 
measured trajectories of half-integer charges from five samples with different 
widths w (symbols), showing good agreement with simulation results (dashed 
lines). b, c, The LCP and RCP resonances switch positions as w increases from 
0.4a (b) to 0.4034 (c), with the merging point being the UGR. The colour map 
shows the measured ellipticity of the downward radiation fields. 
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radiation (X, Y, Z).b, Upward and downward radiation intensities from 

the resonances as the excitation wavelength scans from1,535nmto1,565nm. 
c,d, Experimental results (red crosses) of the band structure (c) and Q, (d), 
showing good agreement with the simulation results (blue lines). 


This switching behaviour is further confirmed by measuring the 
ellipticity p of the far-field polarization: when w increases from 0.4a 
(Fig. 5b) to 0.403a (Fig. 5c), the LCP (RCP) resonance, shown in red 
(green), moves from the top (bottom) to the bottom (top) half of the 
momentum space. Taken together, these experimental results confirm 
our topological interpretation shown in Fig. 1d: UGRs arise when two 
half-integer charges with opposite helicities bounce into each other 
in momentum space. 

To summarize, we present a type of resonance, which we call UGR, 
that radiates only towards the top of a photonic crystal slab, even 
without a bottom mirror. We experimentally demonstrate their exist- 
ence by showing that the downward radiation field vanishes. Through 
polarimetry measurements, we further demonstrate the topological 
nature of these resonances as the merging point between half-integer 
topological charges. Owing to their unique properties, UGRs could be 
used as energy-efficient grating couplers (see Methods for discussion) 
with further applications in photonic-crystal surface-emitting lasers, 
light detection and ranging antennas. 
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Methods 


Numerical simulation details 

All simulations are performed using the COMSOL Multiphysics 5.2 
‘Radio Frequency’ module in the frequency domain. Two-dimensional 
models (in the x-y plane) are created to simulate one-dimensional pho- 
tonic crystal slabs with perfectly matching layers along the y direction. 
Periodic (Floquet) boundary conditions are applied in thex direction. 
The meshing resolution is set to adequately capture resonances with Q 
values of up to 10°. The eigenvalue solver is used to calculate the band 
structures and quality factors. To calculate the asymmetric radiation 
ratios 7, two surface probes—one above and one below the structure— 
are used to calculate the radiative fields towards both sides. 


Sample fabrication 

Thesampleis fabricated ona single-side polished silicon-on-insulator 
wafer with a silicon layer thickness of 500 nm, a silica layer of 2 um and 
a silicon substrate of ~ 725 pm. The step-by-step fabrication process 
is illustrated in Extended Data Fig. la. The wafer is first cleaved into 
lcm x 1cmchips and cleaned. The photonic crystal pattern is then 
defined using electron-beam lithography. A 110-nm-thick SiO, film is 
thermally deposited using plasma-enhanced chemical vapour deposi- 
tion as the hard mask. A layer of ZEP520A photo-resist (340 nm thick) is 
spin-coated onthe SiO, layer. The photonic crystal patterns are defined 
in the photo-resist layer (Elionix FLS-125), which is then developed 
using o-xylene (>95%). The chip is then placed ona customized wedged 
holder made of Al,O, witha slanted angle of 26° for RIE. The photonic 
crystal patterns are first transferred onto the SiO, hard mask using CF,, 
and are then transferred onto Si using Cl, gas. The residue hard mask 
is removed using BHF wet etching. Finally, the bottom side is treated 
with chemical-mechanical polishing for the measurements of the 
bottom radiation fields. The size of the sample is 500 ppm x 500 um. 
Owing to the shadowing effect, the tilt angles of the two sidewalls are 
not identical, as illustrated in Extended Data Fig. 1b. To best capture 
the UGR design at w = 331 nm, the width of the air gap is varied from 
320 nm to 340 nm. 


Measurement of asymmetry ratio 7 and single-side quality 
factors 

As illustrated in Extended Data Fig. 2, ax-polarized tunable telecom- 
munication laser (Santec TSL-550, C+L band) is sent through a chopper 
for lock-in detection and is focused by a lens (L1) onto the rear focal 
plane of the objective to define the excitation angle. Two identical 
arms are used to measure the two radiation fields from the top and 
the bottom. In each arm, a two-stage 4f system is used to adjust the 
magnification ratio. After passing through an orthogonal polarizer 
in the y direction, the radiation field is collected using a photodetec- 
tor and a camera. To measure radiation fields from resonances from 
a specific k point, two movable pin holes with diameters of 300 pm 
are placed at the Fourier planes to select the desired k point. Upward 
and downward radiation fields go through two identical pin holes and 
are then measured using two identical photodetectors (PDAIODT-EC). 
Each photodetector is connected to a lock-in amplifier (SRS SR830). 
A flip mirror is used to switch between the camera that images the 
light-scattering patterns and the photodetector. 

A ‘cross-polarization filtering’ technique is used to suppress 
unwanted reflections, similarly to some previous works’. Specifi- 
cally, unwanted reflections (caused by lenses or other optical surfaces) 
mostly maintain the incident polarization, whereas most radiation 
fields from guided resonances do not. By placing two orthogonal polar- 
izers in the optical path along the x axis (for excitation) and y axis (for 
observation), unwanted reflection is greatly suppressed. This setup 
also transforms typical asymmetric Fano lineshapes into nearly sym- 
metric Lorentzian lineshapes. An ‘on-resonance pumping’ technique 
is also used inthe setup, similarly to previous works™. As the photonic 


crystal structure shows little dispersion along k, but strong dispersion 
along k,, the scattering patterns are almost straight lines parallel to 
the x axis. 

The central wavelengths (Fig. 4c) and total quality factors (Q,,,; 
Extended Data Fig. 3a) of the guided resonances are extracted by 
numerically fitting the measured spectra; examples are shown in 
Fig. 4b. As both upward and downward radiation fields are measured 
in our setup, the ratio between upward and downward decay rate, n, 
is achieved directly. The observed total quality factor Q,,, includes 
contributions from: (1) non-radiative losses due to material absorption, 
scattering from surface roughness and in-plane lateral leakage and (2) 
radiative losses due to upward and downward radiation. This relation- 
ship can be written as: 


1 1 1 


Qeot . O gastd ‘i Q, os 


The radiative losses can be further separated into top and bot- 
tom channels: 1/Q,=1/Q, + 1/Q,. As shown in Extended Data Fig. 3a, by 
comparing the measured Q,,, (blue crosses) to the calculated radia- 
tive quality factors from an ideal unidirectional design (red line), 
Qnon-rad = 2,080 is extracted. By design, Quon-raq is much larger than Q, 
(200 to 600), so the energy loss is dominated by radiation. Further- 
more, the simulation results from disordered designs (green circles) 
are presented as a reference, where the air-gap locations and widths 
fluctuate with a standard deviation of 1Inm. Using the measured asym- 
metry ration =y,/y, = Q,/Q,, equation (3) can be written as 
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Qrot 


re a 
Oo eas Q, 


(4) 


Using this relationship, the single-side quality factors Q, canbe cal- 
culated accordingly. 


Polarimetry measurement setup 

To demonstrate the topological nature of UGRs, we perform pola- 
rimetry measurements on the downward radiation fields. The experi- 
mental setup is schematically shown in Extended Data Fig. 4. Unlike 
the previous setup, which uses a continuous-wave tunable laser, this 
setup uses a broadband amplified spontaneous emission light source 
with a centre wavelength of 1,550 nm, a bandwidth of 40 nm and an 
output power of 10 dB m. The incident light excites the sample along 
the k, axis, and the incident angle is varied between -1.3° and 1.3° at 
a step size of 0.3°, which is controlled by lens L1. Owing to the broad 
bandwidth of the excitation, all resonances at a given incident angle 
are excited. 

Astandard polarimetry measurement is then performed on the scat- 
tered light to determine the polarization state of each resonance. Spe- 
cifically, the scattered light intensity is measured after passing through 
six configurations of a polarizer and a quarter-wave plate (QWP): (1) no 
QWP, polarizer oriented along the x axis; (2) no QWP, polarizer along 
the yaxis; (3) no QWP, polarizer at 45° with respect to the x axis; (4) no 
QWP, polarizer at 135° with respect to the xaxis; (5) QWP fast axis at 45° 
with respect to the x axis, polarizer along the y axis; (6) QWP fast axis 
at 135° with respect to the x axis, polarizer along the y axis. This set of 
measurements allows us to fully reconstruct the polarization state of 
each resonance® through the Stokes parameters: 


So= IEP + [6] 
2 
Si=|E) ~ IEP ie 
S,=2IE,E,|cos(A6) 
53 =2IE,E,|sin(A6) 


Here, E=E,el"é, + E,e'”*4°@, . Specifically, the ellipticity p = 5,/So is 


maximized (+1) or minimized (-1) when |E,|=|£,| and Aé= +1t/2, which 
correspond to the LCP and RCP resonances, respectively. This allows 
us to locate the half-integer topological charges g=+1/2in momentum 
space by measuring the maximum and minimum ellipticity p of the 
scattered light. 

The ellipticity measurement results for five samples are shown in 
Extended Data Fig. 5. Allsamples share the same design, except for the 
air-gap width w, which varies between 0.399a (Extended Data Fig. 5a) 
and 0.403a (Extended Data Fig. 5e). As w/ais varied, the two half-integer 
charges (with opposite ellipticities) approach and bounce into each 
other before they move apart. The transition point corresponds tothe 
UGR design, whichis confirmed by the switching of the ellipticity before 
and after the transition; namely, LCP (RCP) is initially in the top (bottom) 
half plane, as in Extended Data Fig. 5a, and moves to the bottom (top) 
at the end, as in Extended Data Fig. Se. These experimental results are 
in good agreement with the simulation results (Supplementary Fig. 4) 
and with our topological interpretation presented in Fig. le. 


Robustness of the UGRs to fabrication errors 

In practice, fabricated samples inevitably deviate from their designs 
because of fabrication errors or imperfections. Here we analyse the 
factors limiting the performance of UGRs in realistic samples. The perio- 
dicity of photonic crystal is limited by the accuracy of the electron-beam 
lithography; however, this is often not the limiting factor. In compari- 
son, it is more challenging to fabricate the air gaps (both width and 
tilt angles) exactly as designed, owing to the accuracy of the etching 
processes. First, we assume that the fabricated sample deviates stead- 
ily from the ideal design (a = 825 nm, w = 352 nm, 0, = 79°, 6, = 75°) in 
terms of (1) air-gap width, Aw=+2.5 nm and (2) sidewall angle, A@=+1°. 
The simulation results in Extended Data Fig. 6a, b confirm that when 
the parameters are slightly different from the ideal design, the asym- 
metry ratio remains high, as expected. Furthermore, owing to the 
topological nature of UGRs, a fixed deviation in one parameter can be 
compensated by another parameter to restore the perfect elimination 
of downward radiation fields. For example, as shown in Extended Data 
Fig. 6c, a change of A@ = -—1° in the etching angle can be compensated 
by changing the air-gap width from w = 352 nm to w= 365 nm, where 
the UGRis restored. 

Meanwhile, random fluctuations are also inevitable in fabricated 
samples and they induce scattering losses and lower the asymmetry 
ratios. In this part of the analysis, we assume that the tilted angles are 
fixed while the air-gap locations and widths fluctuate randomly from 
the ideal design with a standard deviation of 1 nm, which is estimated 
from the scanning electron microscope images. The average Q,,, and 
asymmetry ratios for disordered samples are obtained from simula- 
tions, which are compared with the ideal design and the experimental 
results, as shown in Extended Data Fig. 3b. Q,,, drops owing to scattering 
losses. The asymmetry ratio is reduced to approximately 50 dB at its 
peak (from 70 dB) but remains higher than 35 dB in the vicinity of the 
UGR, demonstrating that our designis robust to fabrication errors and 
uncertainties. The ‘cross-polarization filtering’ technique also allows 
us to measure the asymmetry ratio for any k points. As confirmed by 
simulation and experimental results (Extended Data Fig. 7), the asym- 
metry ratios remain higher than 35 dB as the excitation deviates from 
k, = 0 to k,a/(2mt) = 0.06. This provides a 6° tolerance in the polishing 
angle of the angled fibre couplers, which is reasonable in practice. 


Prospects of using UGRs as grating couplers 

Highly directional radiation is desirable in on-chip optoelectronic 
devices suchas lasers, LIDAR antennas and grating couplers. Although 
grating couplers having been studied extensively, their performances 
are still not optimal, with one major challenge arising from unwanted 


downward radiation losses towards the handle wafer side**””. Sev- 
eral mechanisms have been proposed to achieve highly directional 
radiation, including non-resonant blazed gratings and resonance-based 
dual-layer gratings. Some relevant works*®** “4 are listed in Extended 
Data Table 1 for acomparison with our work, whichis based ontopology. 
The measured asymmetry ratio reaches a maximum of 27.7 dB; namely, 
99.8% of the radiation field is upward and 0.2% is downward (Extended 
Data Fig. 8). Near the UGR, strong suppression of the downward radia- 
tionis achieved across a reasonably broad bandwidth: over 90% of the 
upward radiation energy is maintained within a 26 nm bandwidth from 
1,536nmto 1,562 nm, as shown in Extended Data Fig. 8a. Furthermore, 
we achieve robust suppression of downward radiation at different 
out-coupling angles between 5° and 11°, as shown in Extended Data 
Fig. 8b. Although we have not fully characterized the fibre-to-waveguide 
losses for our design, the UGRs that we demonstrate here naturally 
eliminate downward radiation and provide a practical and effective 
method to suppress downward radiative losses. 


Data availability 


The datasets generated and analysed during the current study are 
available from the corresponding author upon request. 
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Extended Data Fig. 1| Sample fabrication. a, Step-by-step flow chart of the fabrication process. b, Schematics of the customized RIE process. EBL, 
electron-beam lithography; PECVD, plasma-enhanced chemical vapour deposition; CMP, chemical-mechanical polishing. 
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Extended Data Fig. 2| Experimental setup used to measure the asymmetry respectively. RFP, rear focal plane; PD, photodetector; Obj, objective; Pol, 
ration. The setup is capable of both near- and far-field measurements. The polarizer; Amp, amplifier; BS, beam splitter. Nland N2 denote the movable 
focallengths of lenses L2,L3,L4andL5are150mm,100mm,75mmand75mm, _ lensesusedtoachive near-field imaging. 
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Extended Data Fig. 3 | Experimental and simulation results for disordered 
samples. a, Experimentally extracted Q,,, (blue) compared with simulation 
results for samples with (green) and without (red) disorder. b, Measured 
asymmetry ratio 7 (blue) compared with simulation results for samples with 
(green) and without (red) disorder. 
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Extended Data Fig. 4| Experimental setup used for polarimetry 
measurements. An amplified spontaneous emission (ASE) source excites the 
resonances inthe sample. Scattered light is recorded by acamera under six 
different combinations of a polarizer (Pol) anda QWP. The focal lengths of 
lenses L2,L3,L4 and L5are150 mm,100 mm, 75mmand 75mm, respectively. 
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Extended Data Fig. 5 | Experimental observation of the evolution of slightly different air-gap widths w, ranging from w/a = 0.399 (b) to 0.403 (f). 
half-integer charges. a, UGR as the merging point between two half-integer Dark red (9 =1) and dark green (p=~1) colours indicate the locations of the LCP 


charges. b-f, Measured ellipticity p of the resonances in five samples with and RCP resonances, which are also half-integer topological charges. 
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Extended Data Fig. 6| Robustness of UGRs against parameter variations. 
a, Device performance when the air-gap widths deviate by +2.5nm from the 
perfect design. b, Device performance when the etching angle deviates by +1° 
from the perfect design (grey). c, The UGRis restored if the etching angle 
deviates by -1° from the perfect design and the air-gap width changes to 
w=365nm. 
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Extended Data Fig. 7 | Asymmetry ratio for modes near UGRs. Simulated 
(left) and measured (right) asymmetry ratios 7 for resonances close to the UGR 
inmomentum space. 
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Extended Data Fig. 8 | Prospects of using UGRs as grating couplers. 

a, Asymmetry ratio 7 between upward and downward radiation intensities fora 
fixed out-coupling angle of 9°. The maximum reaches 27.7 dB near the UGRand 
remains high (above 10 dB) over a bandwidth of 26 nm. b, Highly directional 
emission is observed over a wide range of excitation wavelengths and for 
different out-coupling angles. The fibre-to-waveguide loss is not measured. 
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Extended Data Table 1| Comparison of different mechanisms used to achieve highly directional radiation 


Asymmetry ratio (dB) 


Maximum coupling efficiency (%) 


Mechanism 
numerical experimental! numerical experimental 2 

non-resonant blazing effect 8.7 x 80.1 x [38] 
non-resonant blazing effect 20 x up to 99 x [39] 
non-resonant blazing effect 20 > 7.96 up to 99 86.2 [40] 
non-resonant blazing effect 20 x up to 99 x [41] 
dual-layer guided resonance 20 > 10.6 95 92 [36] 
dual-layer guided resonance 20 x 95 x [42] 
dual-layer guided resonance 21 x 99.2 x [43] 
dual-layer guided resonance 8.9 x 70 x [44] 
UGR 70 27.7 x ‘e this 

work 


Data from refs. °°" and from this work. 


‘Lower bound on the measurement value. 
?Not including taper loss. 
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Two-dimensional materials and their heterostructures constitute a promising 
platform to study correlated electronic states, as well as the many-body physics of 
excitons. Transport measurements on twisted graphene bilayers have revealed a 


plethora of intertwined electronic phases, including Mott insulators, strange metals 
and superconductors’ >. However, signatures of such strong electronic correlations in 
optical spectroscopy have hitherto remained unexplored. Here we present 
experiments showing how excitons that are dynamically screened by itinerant 
electrons to form exciton-polarons®’ can be used as a spectroscopic tool to investigate 
interaction-induced incompressible states of electrons. We study amolybdenum 
diselenide/hexagonal boron nitride/molybdenum diselenide heterostructure 

that exhibits along-period moiré superlattice, as evidenced by coherent 
hole-tunnelling-mediated avoided crossings of an intralayer exciton with three interlayer 
exciton resonances separated by about five millielectronvolts. For electron densities 
corresponding to half-filling of the lowest moiré subband, we observe strong layer 
pseudospin paramagnetism, demonstrated by an abrupt transfer of all the (roughly 
1,500) electrons from one molybdenum diselenide layer to the other on application ofa 
small perpendicular electric field. Remarkably, the electronic state at half-filling of each 
molybdenum diselenide layer is resilient towards charge redistribution by the applied 
electric field, demonstrating an incompressible Mott-like state of electrons. Our 
experiments demonstrate that optical spectroscopy provides a powerful tool for 
investigating strongly correlated electron physics in the bulk and paves the way for 
investigating Bose—Fermi mixtures of degenerate electrons and dipolar excitons. 


Van der Waals heterostructures incorporating transition metal dichal- 
cogenide (TMD) bilayers open up new avenues for exploring strong 
correlations using transport and optical spectroscopy. In contrast 
to similar structures in III-V semiconductors, these heterostructures 
exhibit possibilities for exotic material combinations, the creation of 
moiré superlattices exhibiting narrow electronic bands®* “and strong 
binding of spatially separated interlayer excitons”. Recently, transport 
experiments in twisted bilayer graphene demonstrated strongly cor- 
related electron physics in a single system! ®, ranging from supercon- 
ductivity to a Mott insulator state as the filling factor vis varied. In fact, 
this system realizes a two-dimensional (2D) Fermi-Hubbard model on 
a triangular lattice with a tunable electron density. 

In parallel, optical spectroscopy in van der Waals heterostructures 
have revealed the prevalence of many-body hybrid light-matter states, 
termed exciton-polarons®’, in the excitation spectra of electron- or 
hole-doped monolayers. Advances in material quality and device fab- 
rication has led to the observation of moiré physics of non-interacting 
excitons in TMD heterobilayers” ’°. The potential of this new system 
for many-body physics was recently revealed in a demonstration 
of a long-lived interlayer exciton condensate”. Here we describe 


experiments in a heterostructure incorporating a molybdenum dis- 
elenide/hexagonal boron nitride/molybdenum diselenide (MoSe,/ 
hBN/MoSe,) homobilayer that in several ways combine the principal 
developments in these two fields to demonstrate interaction-induced 
incompressible states of electrons. We provide an unequivocal demon- 
stration of the hybridization of inter- and intralayer excitons by coher- 
ent hole tunnelling™’>"* between the two MoSe, layers: the avoided 
crossings in the optical reflection not only show the formation of dipo- 
lar excitons with strong optical coupling but also reveal the existence of 
moiré bands of interlayer excitons. We then demonstrate that intralayer 
exciton-polaron resonances provide a sensitive tool to investigate cor- 
related electronic states in the bulk. Equipped with this spectroscopic 
tool, we observe strong layer pseudospin paramagnetism”” and an 
incompressible Mott-like state of electrons at half-filling of each layer. 


Device structure and basic characterization 

Figure la shows a schematic of the device structure. By using a 
double-gate structure, we control the electric field £, and the chemi- 
cal potential y of the device independently (see Methods for details). 
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Fig. 1| Device structure and basic characterization. a, Schematic of the 
device structure. V,, (V,,) is the applied voltage to the top (bottom) gate. 

b, Schematic of coupled inter- and intralayer exciton (top) and electronsina 
moiré lattice probed by the intralayer exciton (bottom). Purple planes 
correspond to MoSe, layers and dashed lines indicate a moiré lattice. The pink 
(light blue) circles with a+ (-) sign indicate holes (electrons) forming excitons, 
and the electron-hole pair enclosed by the red (yellow) ellipse indicates 
intralayer (interlayer) exciton. The green double arrowin the top panel 
indicates tunnel coupling (¢) of holes througha monolayer hBN barrier. The 
light green circles inthe bottom panel indicate electrons filling the moiré 
lattice. c, Spatial map of integrated photoluminescence from1.59 eV to 1.65 eV. 
The blue and yellow dashed lines indicate the boundary of the monolayer 
MoSe, and MoSe,/hBN/MoSe, regions, respectively. The inset shows 
representative PL spectra of monolayer MoSe, and MoSe,/hBN/MoSe, 
measured at the positions indicated with the blue and yellow stars inthe main 
figure, respectively. 1L, monolayer; 2L, bilayer. 


Figure 1b (top) is a schematic image of a dipolar exciton formed by 
coherent coupling of an interlayer exciton (IX) and an intralayer exciton 
(X) via hole tunnelling. Figure 1b (bottom) shows a sketch of electrons 
ina moiré lattice, probed by intralayer excitons. 

Figure 1c shows a spatial map of total photoluminescence (PL) from 
the device. Here both the top and bottom gate voltages are at zero 
volts. We observe PL from regions with monolayer MoSe,, but not from 
bilayer MoSe,, where two MoSe, flakes are in direct contact (the region 
around the point indicated by the white arrow in Fig. 1c). In contrast, 
the MoSe,/hBN/MoSe, region shows bright PL. This indicates that 
the heterostructure becomes a direct-bandgap system owing to the 
reduction of the interlayer hybridization of the valence bands at the 
I point”, due to the presence of monolayer hBN. Typical PL spectra of 
the monolayer MoSe, and the MoSe,/hBN/MoSe, region are shown in 
the inset of Fig. 1c: there are pronounced intralayer exciton lumines- 
cence peaks in both regions. We observe two distinct exciton peaks in 
the MoSe,/hBN/MoSe, region. This strain-induced energy difference 
between the PL from the top and bottom layers varies across the sample 
(Supplementary Information section 2). 


Coherent interlayer hole tunnelling and dipolar 
excitons 

We first analyse the electric field (F,) dependence of the elementary 
optical excitations of the MoSe,/hBN/MoSe, region in the absence of 
itinerant electrons or holes. To this end, we scan the top and bottom 


gate voltages together to change F, while keeping the system in the 
charge-neutral regime. The obtained PL spectrum is depicted in Fig. 2a: 
using the top (V,,) and bottom (V,,) gate voltage dependence, we deter- 
mine that the PL spectra around 1.632 eV and 1.640 eV stem from an 
intralayer exciton in the top and bottom layer (X,,., and X,,,), respec- 
tively. For high values of |F,| depicted in the top and bottom parts of 
the colour-coded PL spectrum, we observe PL lines with a strong E, 
dependence: we identify these PL lines as originating from interlayer 
excitons with a large dipole moment leading to a sizeable Stark shift. 

The spectra for a positive (negative) V,, regime correspond to the 
interlayer exciton IX, (IX,), which has a hole in the bottom (top) layer 
andanelectroninthetop (bottom) layer. The associated dipole moment 
of the interlayer exciton changes its polarity for V,,~ 0. By extrapolating 
the IX, and IX, PL lines and finding their crossing point, we estimate 
the energy difference between the inter- and the intralayer exciton 
resonances at F,=0, which allows us to determine their binding energy 
difference to be 0.1+ 0.01 eV (Supplementary Information section 6). 

Figure 2b shows the differential reflectance (AR/R,) spectrum 
obtained for the same range of gate voltage scan as that of Fig. 2a. Here 
AR/Ro= (R- Ro)/Ro, With R and R, denoting the reflectance signal from 
the MoSe,/hBN/MoSe, region and background reflectance, respec- 
tively. In accordance with the PL data (Fig. 2a), we find X,,, and Xpo 
resonances around 1.632 eV and 1.640 eV, respectively. Moreover, for 
Vig? 7.5 V (VigS -7.5 V), we observe IX, (IX,) hybridizing exclusively with 
Xtop (Xbor). Figure 2c, d shows magnified plots of the regions highlighted 
with blue and green dashed lines in Fig. 2b, confirming avoided crossing 
of anintralayer exciton with multiple interlayer excitons. We first note 
that the observation ofa sizeable reflection signal from IX, away from 
the avoided crossing suggests that it is possible to resonantly excite 
long-lived interlayer excitons in these structures. The hybridization of 
IX, lines with X,,,, together with the lack of an avoided crossing with 
Xpor in Fig. 2c, unequivocally shows that avoided crossings originate 
exclusively from the coherent hole tunnelling schematically shown 
in Fig. 2e. Our observation, proving that the hole tunnel coupling is 
much larger than that of the electron, is consistent with the band align- 
ment expected from first-principles band-structure calculations”. This 
conclusion is also confirmed by the data depicted in Fig. 2d, where 
avoided crossing originates from the coherent-hole-tunnelling-induced 
hybridization of IX, and X,,, schematically shown in Fig. 2f. 

One ofthe most remarkablefeatures of thespectradepictedin Fig. 2c,d 
is the existence of multiple avoided crossings associated with three 
interlayer exciton resonances separated in energy by about 5 meV. This 
interlayer exciton fine structure demonstrates the existence of amoiré 
superlattice®°"'?""*, originating from a small twist angle between the 
two MoSe, layers (Methods). The presence of an hBN tunnel barrier 
strongly suppresses the strength of the associated moiré potential, 
rendering it sizeable for only the interlayer excitons®. 


Charge configuration detection by exciton-polaron 
spectroscopy 

The presence of itinerant charges drastically alters the optical excitation 
spectrum”, Recent theoretical and experimental work established that 
the modified spectrum originates from dynamical screening of excitons 
by electrons or holes®’, leading to the formation of a lower-energy 
attractive polaron (AP) branch. Concurrently, the exciton resonance 
evolves into a repulsive polaron (RP) (Supplementary Information 
section 1). The strong sensitivity of the RP resonance energy to changes 
in electron density renders it anideal spectroscopic tool for sensing the 
electron density ninthe same layer”. The strain-induced resonance 
energy difference between X,,, and X,,, ensuring different energies 
for the corresponding RP,,, and RP,,,, together with the much weaker 
sensitivity of RP,., (RPyo.) to the electron density n, (n,) inthe bottom 
(top) layer, allows us to determine the charging configuration of the 
two layers simultaneously. As we are predominantly interested in the 


Nature | Vol580 | 23 April 2020 | 473 


Article 


a Energy (eV) c Energy (eV) e 
1.64 1.62 1.63 1.64 a0 
9.0 | Electric field 
7.5 CB 
= Q Ss 85 Ix g 
® + <— T 3 
a fe} g2 fe} 
=) 3 9 8.0 -7.0 3 
a Q @Q 
° yp 68 a 
F oO e° o Ex 
Ss 75 65 < B 
VB 
7.0 Xtop 
-6.0 MoSe, han MoSe, 
——>_ | to bottom 
10! 10? 108 104 10§ -1.0 -0.5 0 0.5 1.0 oe , : 
PL (counts s~) AR/Ry 
b Energy (eV) d Energy (eV) f 
1.60 1.62 1.64 1.66 1.63 1.64 1.65 
; Electric field 
Energy 
CB 
= 2 2 A 
© o s 
£ g 3 kg 
2 o 2 é Xpot 
e o = E,& “@-- A, Ex 
Ss = B 
F VB 
-10.0 


Fig. 2 | Electric field dependence of PL and differential reflectance at 
charge neutrality. a, b, Gate dependence of PL (a) and differential reflectance 
(b) of MoSe,/hBN/MoSe,. Top and bottom gate voltages are scanned together 
totune the electric field at aconstant chemical potential (scanned along the 
dashed line L4 shown in Fig. 3c). The intensity of the PLinais shown onalog 
scale.c, d, Magnified plots of b. The corresponding region ofc and dis 
indicated by the blue and green dashed rectangles inb, respectively. 


low-carrier-density regime where the quasiparticle (bare exciton) 
weight of the RP resonance is close to unity, we will refer to it as the 
exciton resonance. 

Figure 3a, b shows the gate voltage dependence of AR/R, at 
E=1.632 eV and £ = 1.640 eV, which correspond to the top (X,,,) and 
bottom (X,,,) intralayer exciton resonance energy in the charge-neutral 
regime, respectively. The insets to these figures showa line cut through 
the dispersive neutral exciton reflection spectrum, indicating the exci- 
ton energies at which we monitor AR/R). As a small increase of elec- 
tron density from about 0 to 1 x 10" cm’ results in a change of AR/R, 
from about -1 to 20, the blue regions in Fig. 3a, b correspond to the 
charge-neutral regime of each layer. The red and white regions in turn, 
correspond to the electron- or hole-doped regime of each layer. This 
all-optical determination of the charge map of the bilayer provides 
aninvaluable tool for monitoring the bulk properties of 2D materials. 

To enhance the sensitivity of the charge map and to visualize the 
charge configuration of both layers at the same time, we evaluate and 
overlay the derivative of AR/R, (d(AR/R,)/dE) obtained for both layers 
(see also Supplementary Information section 7). The resulting charge 
map, depicted in Fig. 3c, is closely reminiscent of the charging diagram 
used to characterize gate-defined quantum dots”. The blue regions 
in Fig. 3c correspond to gate voltages where the charge configuration 
changes, allowing us to clearly separate the regions (t, b) where the top 
or bottom layer is neutral (t =i or b=i), electron doped (t=n or b=n) 
or hole doped (t=porb=p). 

We show the typical gate voltage dependence of AR/R, in Fig. 3d,e 
obtained while scanning the two gates along the lines L1 and L2, indi- 
cated in Fig. 3a, b, respectively. In both plots, we confirm the emergence 
of the AP resonance and the associated blueshift of the exciton or RP 
energy around the charge configuration transition points, confirming 
the assignment obtained from d(AR/R,)/dE in Fig. 3c. 
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e, f, Schematic of the energy bands and the exciton energy alignment under 
electric fields where the excitons are in resonance. CB and VB denote the 
conduction and valence bands, respectively, and F, denotes the exciton 
binding energy. Green double arrows represent hole tunnelling. Panels eandf 
are the cases for opposite direction of electric field, and correspond tocandd, 
respectively. 


Figure 3f shows the gate voltage scan along L3, indicated in Fig. 3c, 
where we fixed V,, at 4 V and scanned V.,. As we increase the chemical 
potential by sweeping V,, from negative to positive, we find that initially 
the bottom layer gets electron doped, because the (single particle) 
conduction band energy of the bottom layer is lower than that of the 
top layer for the chosen V,,. For V.g~ 4 V, electrons are introduced into 
the top layer as well. In the absence of interactions, we would expect 
the electron density in the bottom layer to become independent of a 
further increase of V,,, due to screening of V,, by the free electrons in 
the top layer. We observe different behaviour in Fig. 3f: increasing V,, 
results ina decrease and eventually total depletion of electrons fromthe 
bottom layer. The underlying physics is understood by considering the 
intralayer exchange interaction, whichis maximal ifall electrons occupy 
asingle layer. Ifthe ensuing reduction in total repulsive Coulomb energy 
exceeds the kinetic energy cost of having all electronsina single layer, 
layer polarization is favoured, leading to the observed depletion of 
electrons from the bottom layer. This phenomenon, termed negative 
compressibility, was previously observed in transport experiments in 
bilayer semiconductor systems”. 


Interaction-induced incompressible states 


The results we present in the ‘Coherent interlayer hole tunnelling and 
dipolar excitons’ section establish the existence of a moiré superlattice 
for interlayer excitons. The absence of coherent electron tunnelling, 
however, indicates that the corresponding electronic moiré subbands 
in the top and bottom layers do not hybridize. Therefore, our homo- 
bilayer realizes a rather unique system exhibiting flat bands with layer 
and valley-spin degrees of freedom. Moreover, negative compressibility 
(‘Charge configuration detection by exciton-polaron spectroscopy’ 
section) indicates that electron-electron interactions are prominent 
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Fig. 3 | Gate dependence of differential reflectance spectrum of MoSe,/ 
hBN/MoSe,. a, b, Two gates dependence maps of differential reflectance 
around the top (a) and bottom (b) intralayer exciton resonances (F=1.632 eV 
and F=1.640eV, respectively). The insets of aand bshow the differential 
reflectance spectrumat (V,,, Vg) = (0 V, 0 V) (indicated with the white stars in 
the maps). The magenta and cyan dots in the insets indicate the points where 
E=1.632 eV and F=1.640 eV, respectively. c, Charge configuration diagram 
obtained by derivative of the differential reflectance spectrum with respect to 
energy (sum of the derivatives at F=1.632 eV and E=1.640 eV). The charge 
configuration for each layer is indicated by p, iandn, which correspond to hole 


even at relatively high electron densities (n ~ 1 x 10” cm) where 
several moiré bands in one layer are occupied. In this section, we explore 
electron correlation effects by focusing on the low-n regime of the 
charging map (Fig. 3c) where the (i, i), (i, n), (n, i) and (n, n) regions 
coalesce. The high sensitivity of the exciton/RP resonance energy, as 
well as the AP oscillator strength, to changes in electron density once 
again forms the backbone of our investigation. 

Figure 4a shows the gate voltage dependence of AR/R, at energy of 
1.6320 eV. The choice of the energy at which we monitor AR/R, maxi- 
mizes the sensitivity to the shift of X,,, and is indicated by the magenta 
pointin the inset of Fig. 4a. We nowchoose the horizontal and vertical 
voltage axes to be V-=0.5V,,— 0.5V,gand V,, = (7/15) Vig + (8/15) Vig. With 
this choice, vertical (V,, axis) and horizontal (V; axis) cuts through the 
reflectance map leave £, and 7 unchanged, respectively. Remarkably, 
Fig. 4a shows a periodic modulation of the reflectance asa function of 
V,» particularly in the low-n regime. Moreover, the modulation of the 
Xtop and X,,, reflectance are correlated and symmetric with respect to 
the V,=-1V axis (Supplementary Fig. 4), indicating that for this value 
of V,, the conduction band edge energy of the two layers is aligned. 

To gain further insight, we first determine the repulsive polaron 
resonance energy (EX, 2 by fitting the reflectance spectrum witha 
dispersive Lorentzian line shape (Supplementary Information sec- 
tion 3) and then plot the derivative of Ey, with respect to V, in Fig. 4b. 
The resulting map shows a remarkable checkerboard pattern that is 
complementary for the top and bottom layers (Supplementary Fig. 4). 


doped (p), neutral (i) and electron doped (n), and shown inthe order of 

(top, bottom). d-f, Gate dependence of differential reflectance along the 
dashed lines L1(d), L2 (e) and L3 (f) shown in a-c. Magenta and cyan dashed 
lines indicate the top (E=1.632 eV) and bottom (E=1.640 eV) exciton resonance 
energies, respectively. APC andRPy stand for intralayer attractive and repulsive 
polarons, where L is ‘top’ or ‘bot’ stands for top or bottom layer, and Cis + or- 
stands for hole or electron as Fermi sea carriers. Charge configuration is 
written in green together with the green dashed lines indicating the charge 
configuration transition point. 


As the blueshift of X,,, (X,,.,) resonance while increasing V,, corresponds 
to filling of electrons in the top (bottom) layer, the complementary 
checkerboard patterns in Fig. 4b and Supplementary Fig. 4f indicate 
a layer-by-layer filling of electrons”. 

The observed periodicity in Fig. 4 indicates the existence of moiré 
subbands for electrons. In anticipation of the subsequent discussion, 
we define a layer filling factor v, (where L is ‘top’ or ‘bot’, indicating 
top or bottom layer, respectively) such that v, = 1/2 corresponds to 1 
electron per moiré unit cell of a single layer, and a total filling factor v 
aS V= Viop + Vbot- From a capacitive model of our device, we determine 
that v=1/2 coincides with electron density of n=2 x10" cm”. At this 
low electron density, the r, parameter, which describes the ratio of 
interaction energy to kinetic energy, is estimated to ber, 214. The den- 
sity periodicity corresponds toa moiré superlattice lattice constant of 
A moire = 24 nm by assuming a triangular superlattice (see Methods for 
the estimation ofr, and A,,oi¢). We indicate the values of vcorrespond- 
ing to v=1/2, 1, 3/2, 2 with blue dashed lines in Fig. 4b. We confirmed 
that the same periodicity also appears in the AP photoluminescence 
intensity (Supplementary Fig. 4g, h). 

Figure 5a, b shows the V; dependence of AR/Ro for fixed V,, for v=1/2 
and v=1, respectively. In Fig. 5a, we find an abrupt shift of exciton energy 
together with complete oscillator strength transfer between AP,,, and 
AP, or, demonstrating that the electrons are completely transferred 
from one layer to the other. Figure 5c, e shows the extracted X,,, and 
Xtop energies around v= 1/2. Remarkably, the abruptjump in excitonic 
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Fig. 4| Gate dependence of intralayer exciton resonance inthe 
low-electron-density regime. a, Gate dependence map of differential 
reflectance around the top intralayer exciton resonance (FE =1.6320 eV). The 
inset ofashows the differential reflectance spectrum at (V;, V,,) = (-1V, 0.175 V) 
(indicated with the white star in the map). b, Gate dependence map of the top 
intralayer exciton resonance energy differentiated by V,,. 
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resonance is pronounced at v= 1/2, and smeared out for both lower 
(v=0.35) and higher filling factors (v=0.65). These measurements show 
that abrupt transfer of practically all of the roughly 1,500 electrons 
within the region we monitor optically is linked to the emergence of 
an interaction-induced incompressible state in the lowest moiré sub- 
band at v=1/2 filling. As the filling factor is increased or decreased 
towards v= 1/2, the electronic system shows an ever-stronger layer 
pseudospin paramagnetism, due to the enhanced role of interactions, 
but otherwise exhibits a continuous interlayer transfer of electrons asa 
function of E, that would be expected from acompressible state. Close 
to v=1/2, there is a phase transition to an incompressible state that 
can be accommodated in either the top or the bottom layer (Fig. 5g). 
Remarkably, interlayer charge transfer takes place upon changing V 
by only 1.9 mV: the corresponding change in the single-particle energy 
detuning between the top and bottom layers (26 eV) is much smaller 
than k,T (360 peV; where k, is the Boltzmann constant and Tis the tem- 
perature) (Supplementary Information section 5). 

Figure 5b shows that for v = 1, AR/R, is characterized by three 
plateau-like regions. We attribute the abrupt jumps in the excitonic 
resonance energy to the transition from (V,o,, Voor) = (0, 1), through 
(1/2, 1/2), to (1, O) configurations (Fig. 5h). This explanation is con- 
firmed by the corresponding changes in the oscillator strength of the 
AP resonances of the top and bottom layers. In the (1, 0) and (0, 1) con- 
figurations, we measure a reflectance signal either from AP,., OF APyot, 
consistent with full-layer polarization. In the (1/2, 1/2) configuration, 
we find the oscillator strength of AP,,, and AP,,; to be identical and 
equal to half the value obtained under (1, 0) for AP,,.,. The extracted 
excitonic resonance energy for X,,, and X,,, around v= 1 is shown in 
Fig. 5d, f, respectively. The plateau structure of the (1/2, 1/2) state with 
abrupt changes of electron density difference between the layers at 
V-=—1.3 Vand V,=—0.5 Vis clearly visible at v=1, but is smeared out for 
both lower (v=0.85) and higher fillings (v=1.15). The emergence of the 
stabilized (1/2, 1/2) plateau at v=1 strongly suggests that there is mutual 
stabilization of the incompressible electronic state due to the interlayer 
electron-electron interactions. The energy gap of the incompressible 
(1/2, 1/2) state is estimated to be 5.5 meV (Supplementary Information 
section 6). The reflectance data for higher fillings (v=3/2 and v=2) are 


Fig. 5 | Electric field dependence of 
differential reflectancespectrum 
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shown in the Supplementary Information (Supplementary Fig. 5): in 
stark contrast to the (1/2, 1/2) configuration at v = 1, a plateau at the 
(1, 1) electron configuration is missing at v=2 filling, indicating that the 
state with the corresponding integer fillings is not sufficiently stabilized 
by the interlayer interactions. 

Finally, we also perform the measurement under a perpendicular 
magnetic field B,=7 T (Supplementary Information). In Supplementary 
Fig. 6, we find that the plateau structure observed for v= 2 under full 
valley polarization of electrons is identical to that observed for B,=OT, 
although the total number of electronic states per moiré subband is 
halved dueto giant valley-spin susceptibility of electrons in MoSe, (ref.”*). 
This observation shows that the incompressibility is determined 
by filling of each moiré site by a single electron, irrespective of its 
(valley) degeneracy. The observation supports that our identification of 
n=2x10"cm”, yielding half-filling of a single-layer moiré subband. We 
also find that the reflectance spectrum does not show any valley polari- 
zation at B,=7 T for the v< 1/2 state. However, the resilience against 
valley polarization is not sufficient to claim antiferromagnetic order. 


Discussion 


The experiments we describe in the ‘Interaction-induced incompress- 
ible states’ section demonstrate the existence of Mott-like incompress- 
ible electronic states for half-filling of the lowest moiré subband. Unlike 
previous reports” ”°, our experiments are carried out for along moiré 
superlattice lattice constant of Apoirs = 24 nm and an r, parameter of 
r,2 14. The weakness of the moiré potential stemming from the hBN 
layer separating the two MoSe, layers in turn ensures that the on-site 
interaction strength is larger than the depth of the moiré potential. 
In this sense, the homobilayer system realizes a rather unique regime 
that goes beyond the standard Fermi-Hubbard model. 

In addition to establishing twisted TMD homobilayers as a promising 
system for investigating Mott-Wigner physics? * originating from 
strong electronic correlations, our experiments open up new avenues 
for exploring interactions between dipolar excitons and electrons 
confined to flat bands. In particular, the structure we analysed could be 
used to realize and study Bose-Fermi mixtures consisting of degenerate 
electrons strongly interacting with an exciton condensate generated 
by resonant laser excitation. The phase diagram of such a mixture is 
not fully understood”, but is expected to provide a rich playground 
for many-body physics”. 
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Methods 


Device fabrication 

The device structure is shown in Extended Data Fig. 1. All MoSe,, gra- 
phene and hBN flakes were obtained by mechanical exfoliation of bulk 
crystals. The flakes were assembled together using the dry transfer 
technique” in an argon-filled glove box. The crystal axis of top and 
bottom MoSe, layers were aligned to be close to 0° using atear-and-stack 
technique®. We expect to have the effect of moiré potential froma 
small twist angle of MoSe, layers, but not between MoSe, and hBN 
layers due to the large lattice constant mismatch (ayjgs¢, = 3. 32A 
(ref. 72) and Gypy = 2.51 A (ref. °°). The metal electrodes to graphene 
layers are formed by Ti/Au (5 nm/145 nm). The contact to the bottom 
graphene gate is formed by Cr/Au (3 nm/147 nm) using the 1D contact 
technique™ by etching the hBN layer with reactive ion etching using 
CHF./O, as mixture gas. 


Optical spectroscopy 

The PL measurements were performed using a HeNe laser (633 nm) as 
an excitation source. The reflectance measurements were performed 
using a single-mode fibre-coupled broadband light-emitting diode 
with acentre wavelength of 760 nm and a bandwidth of 20 nm. In both 
PL and reflectance measurements, we used a long working distance 
apochromatic objective lens with a numerical aperture of 0.65 (atto- 
cube LT-APO/LWD/VISIR/0.65). All optical spectroscopy measurements 
were performed at cryogenic temperature (7~4 K). All data inthe main 
text were obtained at zero magnetic field, B, = 0 T. 


Capacitance parameters 

The thickness of the top and bottom hBN layers was estimated to be 
63.3nm and 52.7 nm, respectively, from the white light reflectance spec- 
trum. We used 3.7 as the dielectric constant for hBN®”. We expect the 
uncertainty to be about 10% for both hBN thickness and the dielectric con- 
stant, which gives around 14% error in the calculation of carrier density. 


Estimation of r, parameter and moiré periodicity from electron 
density 

The r, parameter (Wigner-Seitz radius divided by the effective Bohr 
radius) that describes the ratio between kinetic energy to Coulomb energy 
is given by r,=1/aj./Tn (ref. °), Here a3 = 411eegh?/mie?=0.91nm is 
the effective Bohr radius in an encapsulated MoSe, monolayer, mz; is 
the effective electron mass, ¢ = (€), Mose, * &, npn)/2 is the lattice dielec- 
tric constant”, €, is the vacuum permittivity, hf is the reduced Planck 
constant and eis the elementary charge. For the calculation we used 
m*= 0.7m, (ref.*8), where m, is the bare electron mass and the in-plane 
dielectric constants €) ose, = 17.1and &) pan = 6.93 (ref. ”). For an elec- 
tron density of n=2 x10"cm™(v=1/2), we obtain r,~14, which is reduced 
tor,~7 at density of n=8 x10" cm (v=2).We emphasize that the above 
values for the r, parameter are underestimated. At these low densities, 
the interelectron separation, fo = 1/./ntt > 10 nm, is much larger than 
the MoSe, layer thickness dyjose, ~ 0.7 nm. In this limit”, the screening 
of the interactions is dominated by the hBN dielectric; therefore, a 
better approximation is to take the dielectric constant to be € = € npn, 
which results in r,~ 24 for anelectron density of n=2 10" cm? (v=1/2) 
and r,=12 forn=8 x10" cm? (v=2). 

The moiré superlattice lattice constant A,,,i;6 is estimated from the 
following relation by assuming a triangular moiré superlattice: 
Amoiré = (13 /2)A2 cine =V/Nmoire » Where A moire iS the moiré superlattice 
unit cell area and n,,,ir6 is the electron density that corresponds to 
one electron occupation per moiré superlattice unit cell. We used 
Nmoire = 2 X 10" cm, which is obtained from the periodicity of the 
differential reflectance modulation as we discussed in the main text, 
which gives A,,oi6* 24 nm. The twist angle 6 between MoSe, layers is then 
estimated to be 6 = 0.8° using the following relation: A, oir¢ = Amose,/8, 
where dyjose, = 3-32 Ais the lattice constant of MoSe, (ref. ”). 


Verification of moiré periodicity estimation using interlayer 
exciton fine-structure analysis 
Here we estimate the moiré periodicity from the fine-structure energy 
scale of the interlayer exciton. In the limit when the moiré potential is 
weak compared with the kinetic energy scale, umklapp scattering by the 
moiré potential is the dominant origin of the fine structure of the inter- 
layer exciton. In this limit, the energy difference of the two lowest energy 
interlayer excitons is approximately described by the kinetic energy of 
the interlayer exciton at the momentum, ,,,,= 41/(/3 Amoire), Where 
A noire iS MOiré periodicity’™. Using the energy splitting of the first- and 
second-lowest energy interlayer exciton AF, 5, Amoirs is obtained 
using 
2.2 
AE, FA moire (1) 
; 2M 


Go ong ON 
moiré (OMAE, > (2) 


where the total mass of excitonis M = mz + m;=1.3m, mis the effective 
electron mass, miis the effective hole mass (m* = 0.7m, from ref. 8 and 
m= 0.6m, from ref.”!) and his the Planck constant. From Fig. 2c in the 
main text, the energy splitting is AF, ,~ 4.4 meV, which corresponds to 
Anoirs ~ 19 nm. At a different spot, from Supplementary Fig. 3d, the 
energy splitting is AF, ,~3.7 meV, which corresponds to A noire 20 nM. 
Both values are qualitatively similar to what we obtained from the den- 
sity periodicity of the reflectance signal (A,,i¢% 24 nm from Fig. 4b and 
Amore 25 nm from Supplementary Fig. 10). We note that the third-lowest 
energy interlayer exciton line observed in reflectance is not captured 
well by this simple model. In reality, the energy spacing of the interlayer 
excitons depends on the magnitude of moiré potential, and the analy- 
sis we present here could only yield a rather rough estimate of the moiré 
periodicity. 


Effect of strain on moiré periodicity 

A moiré superlattice emerges from a twist angle, lattice constant dif- 
ference or acombination of the two. Inthe main text, we attributed our 
superlattice to be originating from this twist angle. Here we discuss how 
much strain-induced lattice constant modification changes the moiré 
periodicity. Moiré periodicity for twist angle of 0 and lattice constant 
difference ratio of two layers 6 (ref. *°), which corresponds to strain 
difference of two layers, is expressed as: 


Amose,(1 + 6) 


(21+ 6)(1- cos6) +57 


(3) 


A moire = 


Extended Data Fig. 2a plots the relation between the biaxial strain 
difference (6) and twist angle (6) for fixed A,,,ie using equation (3) 
(Amose, = 3-32 A). Inthe main text, we have shown that there is an intra- 
layer exciton energy difference between the top and bottom layers, 
whichis about 8 meV. By assuming this energy difference to arise from 
the strain difference between the two layers, we extract the amount of 
strain difference to be on the order of about 0.1-0.25%. We base our 
rough estimate on values reported in literature: uniaxial strain MoS;: 
70 meV % 1! (ref. *4), 45 meV % " (ref. 47), 48 meV % ! (ref. *); biaxial strain 
MoSe,:33 meV % 7 (ref. 4). For Ajoirs* 24 nm, the moiré periodicity within 
this strain range is dominated by twist angle and hardly modified by 
strain (only 1.4% reduction of A,,:-< for 0.25% strain assuming fixed twist 
angle of 0.8°, as we show in Extended Data Fig. 2b). However, to intro- 
duce a moiré periodicity of about 24 nm with only strain difference, 
more than 1.3% of strain difference is required, which is unlikely from 
the magnitude of the energy splitting of the intralayer excitons. 
The spectrum at another spot, which we show in Supplementary 
Information section 2, exhibits 5 meV energy splitting of the top- and 


bottom-layer intralayer excitons. From this energy difference, the 
amount of strain difference is estimated to be in the order of about 
0.07-0.15%, which results in only 0.5% of Anois reduction, assuming 
fixed twist angle of 0.8°. 


First-principles calculation of the tunnel coupling and the moiré 
potential 

We perform density functional theory (DFT) calculations using Quan- 
tum ESPRESSO*. We use projector-augmented-wave pseudopotentials 
witha generalized gradient approximation (Perdew-Burke-Ernzerhof 
functional) from PSlibrary 1.0.0 (ref. *°). To reduce computation cost, we 
neglect spin-orbit interaction and used non-relativistic pseudopoten- 
tials. Computations are performed with 48 cores ona high-performance 
computing cluster. 

We take lattice structure parameters for MoSe, from ref. *”, which 
gives a lattice constant Ayjoce, = 3.32 A. The lattice constant of hBN 
given inref. °° is d,py = 2.504 A. We uniformly stretched the hBN lattice 
by-0.56% and set the lattice constant of hBNas aft" = 2.49 Atoform 
a3 x3 MoSe, and 4 x 4hBN commensurate supercell. We take the inter- 
layer distance of MoSe, and hBN from ref. *8, and set the thickness of 
vacuum layer between MoSe, and hBN as dse-npy = 3-36 A. We use a 
plane-wave cutoff energy of 60 Rydberg and a charge density cutoff 
energy of 480 Rydberg. The Brillouin zone is sampled with a9 x9 x1 
k-point grid. We use a slab geometry with a 30-A-thick vacuum layer. 

Extended Data Fig. 3a—c shows the actual R-stacked (0° twist angle) 
MoSe,/hBN/MoSe,; supercell for different lattice displacement con- 
figurations that we use for the calculation. a=h, X, M denote hexagon 
centre, chalcogen and metal site of TMDs, and Rf denotes the specific 
lattice displacement for R-stacking where the a site of top MoSe, is 
aligned with the h site of the bottom MoSe,. Extended Data Fig. 3d-f 
shows the calculated band structure shown with the mini-Brillouin 
zone. We take the vacuum level as the energy reference (0 eV). Owing 
to zone folding, the K and K’ points of MoSe, come to the y point of the 
mini-Brillouin zone (the K point of hBN comes to the x point of the 
mini-Brillouin zone). Therefore, the lowest conduction bands and the 
highest valence bands at the y point are the ones from MoSe,. As the 
interlayer hybridization effect is substantially smaller than the directly 
contacting TMD heterostructure® ”°, these band structures look almost 
thesame except fora slightly visible energy splitting of Rhdisplacement. 
Though the energy modulation is substantially smaller, the calculation 
shows about 0.5 meV modulation for the lowest conduction band and 
5meV modulation for the highest valence band dependent on the lat- 
tice displacement (Extended Data Table 1). In addition, the calculation 
shows 11-meV tunnel splitting of the valence band edge at Ri displace- 
ment, which is qualitatively in good agreement with experimentally 
observed avoided crossings of intralayer and interlayer excitons medi- 
ated by hole tunnelling. We emphasize that these results are rather 
qualitative, and in reality, it depends on many factors suchas interlayer 
distance, outer dielectric environment and so on. 


Data availability 


The data that support the findings of this study are available inthe ETH 
Research Collection (http://hdl.handle.net/20.500.11850/399579). 


34. Wang, L. et al. One-dimensional electrical contact to a two-dimensional material. Science 
342, 614-617 (2013). 
35. Kim, K. et al. van der Waals heterostructures with high accuracy rotational alignment. 
Nano Lett. 16, 1989-1995 (2016). 
36. Catellani, A., Posternak, M., Baldereschi, A. & Freeman, A. J. Bulk and surface electronic 
structure of hexagonal boron nitride. Phys. Rev. B 36, 6105-6111 (1987). 
37. Laturia, A., Van de Put, M. L. & Vandenberghe, W. G. Dielectric properties of hexagonal 
boron nitride and transition metal dichalcogenides: from monolayer to bulk. npj 2D 
Mater. Appl. 2, 6 (2018). 
38. Larentis, S. et al. Large effective mass and interaction-enhanced Zeeman splitting of 
-valley electrons in MoSe,. Phys. Rev. B 97, 201407 (2018). 
39. Rytova, N.S. The screened potential of a point charge in a thin film. Moscow Univ. Phys. 
Bull. 22, 18-21 (1967). 
40. Lu, C.-P., Li, G., Watanabe, K., Taniguchi, T. & Andrei, E. Y. MoS,: choice substrate for 
accessing and tuning the electronic properties of graphene. Phys. Rev. Lett. 113, 156804 
(2014). 
41. He, K., Poole, C., Mak, K. F. & Shan, J. Experimental demonstration of continuous electronic 
structure tuning via strain in atomically thin MoS,. Nano Lett. 13, 2931-2936 (2013). 
42. Conley, H. J. et al. Bandgap engineering of strained monolayer and bilayer MoS,. 
Nano Lett. 13, 3626-3630 (2013). 
43. Zhu, C.R. etal. Strain tuning of optical emission energy and polarization in monolayer 
and bilayer MoS,. Phys. Rev. B 88, 121301 (2013). 
44. Frisenda, R. et al. Biaxial strain tuning of the optical properties of single-layer transition 
metal dichalcogenides. npj 2D Mater. Appl. 1, 10 (2017). 
45. Giannozzi, P. et al. QUANTUM ESPRESSO: a modular and open-source software project 
‘or quantum simulations of materials. J. Phys. Condens. Matter 21, 395502 (2009). 
46. Dal Corso, A. Pseudopotentials periodic table: from H to Pu. Comput. Mater. Sci. 95, 
337-350 (2014). 
47. Rasmussen, F. A. & Thygesen, K. S. Computational 2D materials database: electronic 
structure of transition-metal dichalcogenides and oxides. J. Phys. Chem. C 119, 
13169-13183 (2015). 
48. Zollner, K., Faria, P. E. Jr & Fabian, J. Proximity exchange effects in MoSe, and WSe, 
heterostructures with Crl,: twist angle, layer, and gate dependence. Phys. Rev. B 100, 
085128 (2019). 


Acknowledgements We acknowledge discussions with E. Demler, R. Schmidt, T. Smolenski, 
A. Popert and P. Knuppel. This work was supported by the Swiss National Science Foundation 
(SNSF) under grant number 200021-178909/1 and the European Research Council (ERC) 
Advanced Investigator Grant (POLTDES). Y.S. acknowledges support from the Japan Society 
for the Promotion of Science (JSPS) overseas research fellowships. K.W. and TT. acknowledge 
support from the Elemental Strategy Initiative conducted by MEXT, Japan, A3 Foresight by 
JSPS and CREST (grant number JPMJCR15F3) and JST. 


Author contributions Y.S. and |.S. carried out the measurements. Y.S. designed and fabricated 
the sample. M.K. helped to prepare the experimental setup. K.W. and TT. grew the hBN crystal. 
Y.S. performed DFT calculation. Y.S., I.S. and A.I. wrote the manuscript. A.I. supervised the 
project. 


Competing interests The authors declare no competing interests. 


Additional information 

Supplementary information is available for this paper at https://doi.org/10.1038/s41586-020- 
2191-2. 

Correspondence and requests for materials should be addressed to Y.S. or A.. 

Peer review information Nature thanks Wang Yao and the other, anonymous, reviewer(s) for 
their contribution to the peer review of this work. 

Reprints and permissions information is available at http://www.nature.com/reprints. 


Article 


MoSe, (top) 


Extended Data Fig. 1| Optical microscope image of the device. The border of 
each flake is highlighted with dashed lines, and the material is indicated inthe 
grey box with the corresponding colour. Gr, graphene. 


f.3) 
io” 


24 

0.8 > 
= E 22 
a = 
o 
20.6 220 
v ic) 
2 8 
Oo p= 
2 0.4 9 18 
3 ‘2 
E = 

0.2 = 16 

0.0 14 0.25% 

0.00 0.25 0.50 0.75 1.00 1.25 1.50 1.75 0.0 0.5 1.0 1.5. 2.0 
Strain difference (%) Strain difference (%) 


Extended Data Fig. 2 | Effect of twist angle and strain on moiré periodicity. a, Plot of the relation between twist angle and strain difference which gives same 
moiré periodicity (A,,i,¢), Shown for Ajoire from 20 to 30 nm. b, Strain difference dependence of moiré periodicity for a fixed twist angle of 0.8°. 
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Extended Data Fig. 3 | Band structure of R-stacked MoSe,/hBN/MoSe, 
heterostructure obtained from DFT calculation. a—c, The side and top view 
of the supercell for Rh (a), Rx (b) and RV (c) used for the calculation. d-f, The 


calculated band structure of R-stacked MoSe,/hBN/MoSe, for Rh (d), Rx (e) and 
R™ (f) lattice displacement. The insets show the magnified plot of the valence 
bands around the y point. 


Extended Data Table 1| DFT calculation results for MoSe,/hBN/MoSe, 


Lattice displacement | Re | Ri | Ri 


Lowest conduction band edge (eV) | -3.8389 | -3.8383 | -3.8383 


Highest valence band edge (eV) | -5.2835 | -5.2889 | -5.2890 
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Ultrathin ferroelectric materials could potentially enable low-power perovskite 
ferroelectric tetragonality logic and nonvolatile memories”. As ferroelectric 
materials are made thinner, however, the ferroelectricity is usually suppressed. Size 
effects in ferroelectrics have been thoroughly investigated in perovskite oxides—the 
archetypal ferroelectric system’. Perovskites, however, have so far proved unsuitable 
for thickness scaling and integration with modern semiconductor processes’. 

Here we report ferroelectricity in ultrathin doped hafnium oxide (HfO,), a 
fluorite-structure oxide grown by atomic layer deposition on silicon. We demonstrate 
the persistence of inversion symmetry breaking and spontaneous, switchable 
polarization down toa thickness of one nanometre. Our results indicate not only the 
absence of a ferroelectric critical thickness but also enhanced polar distortions as film 
thickness is reduced, unlike in perovskite ferroelectrics. This approach to enhancing 
ferroelectricity in ultrathin layers could provide a route towards polarization-driven 
memories and ferroelectric-based advanced transistors. This work shifts the search 
for the fundamental limits of ferroelectricity to simpler transition-metal oxide 
systems—that is, from perovskite-derived complex oxides to fluorite-structure binary 
oxides—in which ‘reverse’ size effects counterintuitively stabilize polar symmetry in 


the ultrathin regime. 


Ferroelectric materials exhibit stable states of collectively ordered 
electrical dipoles whose polarization can be reversed under an applied 
electric field®. Consequently, ultrathin ferroelectrics are of great tech- 
nological interest for high-density electronics, particularly field-effect 
transistors and non-volatile memories”. However, ferroelectricity is 
typically suppressed at the scale of a few nanometres in the ubiquitous 
perovskite oxides°. First-principles calculations predict six unit cells to 
be the critical thickness in perovskite ferroelectrics’ owing to incom- 
plete screening of depolarization fields®. Atomic-scale ferroelectricity 
in perovskites often fails to demonstrate polarization switching”, 
whichis crucial for applications. Furthermore, attempts to synthesize 
ferroelectric perovskite films on silicon’”” are impeded by chemical 
incompatibility*” and the high temperatures required for epitaxial 
growth. Since the discovery of ferroelectricity in HfO,-based thin films 
in 2011”, fluorite-structure binary oxides (fluorites) have attracted 
considerable interest’ because they enable low-temperature synthesis 
and conformal growth in three-dimensional structures on silicon», 


thereby overcoming many of the issues that restrict its perovskite 
counterparts in terms of complementary metal-oxide-semiconductor 
(CMOS) compatibility and thickness scaling’®. Considering the exten- 
sive implications for future computing”””"’, achieving ferroelectricity in 
sub-2-nm-thick doped-HfO, is highly desirable for realizing ultra-scaled 
CMOS-compatible ferroelectric-based devices beyond the 5 nm tech- 
nology node”. 

Here we demonstrate ferroelectricity in ultrathin (1 nm thick) 
Hfo.sZ1'o.202(HZO), grown by low-temperature atomic layer deposition 
(ALD) on silicon. Second harmonic generation and advanced scan- 
ning probe techniques establish the presence of inversion symmetry 
breaking and switchable electric polarization, respectively. Not only 
is ferroelectricity stabilized in ultrathin HZO, but spectroscopic and 
diffraction signatures of its fluorite-structure symmetry also indicate 
enhanced polar distortion in the ultrathin regime. Such size effects in 
this fluorite-structure system do not occur in its perovskite counter- 
parts®, which can be understood from symmetry considerations. In 
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Fig. 1| Size effects in fluorite-structure ferroelectrics. a, In 
fluorite-structure ferroelectrics, the polar distortion presentinthe 
orthorhombic phase can be represented as the centre anion displacement 
(cyan) with respect to its surrounding cation tetrahedron (represented by cyan 
arrow); inthe nonpolar tetragonal phase, the oxygen atom (blue) lies inthe 
polyhedral centre of the tetrahedron. The evolution of the bulk-stable 
monoclinic polymorph tothe high-symmetry tetragonal and polar 
orthorhombic phases in the fluorite-structure structure illustrates the role of 
size effects — surface energies favour higher symmetry - and confinement 
strain — distortions favour lower symmetry — onstabilizing inversion 
asymmetry. In fluorite-structures, the noncentrosymmetric O-phase is higher 


classical perovskites, surface-energy-driven size effects at reduced 
dimensions favour the high-symmetry paraelectric phase (cubic) over 
the low-symmetry ferroelectric phase (tetragonal)*°. Conversely, in 
fluorites, the noncentrosymmetric phase (orthorhombic Pca2,, 
O-phase) is higher-symmetry relative to the bulk-stable centrosym- 
metric phase (P2,/c, M-phase)”. Consequently, surface energies can 
promote (destabilize) inversion symmetry breaking in fluorite (per- 
ovskite) ferroelectrics in the two-dimensional limit”’**. Owing to this 
size-induced noncentrosymmetry—thatis, ‘reverse’ size effects (Fig. la, 
Extended Data Fig. 1)—HZO presents a promising model system in which 
to explore the ultrathin limits of ferroelectricity. 

Thin films of HZO are grown using ALD downto ten cycles on oxidized 
silicon at 250 °C. For reference, approximately 11 ALD cycles correspond 
to 1nm thickness, as confirmed by X-ray reflectivity (XRR, Extended 
Data Fig. 2) and transmission electron microscopy (TEM, Extended Data 
Fig. 3). Subsequently, HZO films are capped by a metal and subjected 
to rapid thermal annealing—henceforth referred to as confinement 
strain (Fig. 1c)—to impart anisotropic thermal stresses. Confinement 
strain has been reported to help distort the high-symmetry tetrago- 
nal fluorite-structure polymorph (P4,/nmc, T-phase) into the polar 
O-phase”® (Fig. 1a). For confined ultrathin HZO films, X-ray diffraction 
analysis indicates a highly oriented noncentrosymmetric structure in 
contrast to polycrystalline thicker films (Extended Data Fig. 4). Inver- 
sionsymmetry breaking in 10 cycle (about 1nm) HZO films is confirmed 
by the presence of second harmonic generation (SHG) (Extended 
Data Fig. 5), previously employed to demonstrate ferroelectricity in 
a two-dimensional material”. 

Beyond inversion asymmetry, ferroelectricity also requires electrical 
switching between polarization states. Resonance-enhanced piezore- 
sponse force microscopy (PFM) demonstrates bistable switching in 
Si/SiO,/HZO heterostructures (Fig. 2a). PFM phase images on ten-cycle 
HZO films (Fig. 2d) show well defined regions of 180° phase contrast— 
corresponding to remanent polarization states (Fig. 2b)—that can be 
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symmetry relative to the bulk-stable centrosymmetric M-phase. Consequently, 
both intrinsic (surface energy) and extrinsic (confinement strain) mechanisms 
can favour ultrathin inversion symmetry breaking in fluorite-structures, in 
stark contrast to size effect trends perovskites (Extended Data Fig. 1).b, 
Cross-sectional ADF STEM image of 20-cycle (about 1.8 nm) HZO, 
demonstrating ultrathin HZO films on silicon vialow-temperature ALD. The Si 
substrate is oriented along the [110] zone axis. c, Schematic heterostructure 
investigated in this work, detailing the ultrathin ferroelectric HZO layer 
deposited on Si/SiO,, and the capping metal layer employed toimpart 
confinement strain during post-deposition rapid thermal annealing 
(Methods). 


rewritten in a non-volatile fashion. Notably, unpoled regions demon- 
strate the same phase contrast as positively poled regions (Fig. 2d), 
indicating that ultrathin HZO exhibits spontaneous polarization. In 
previous studies, field cycling is often required to ‘wake up’ the ferro- 
electricity in HfO,-based fluorites, whichis attributed to a field-induced 
nonpolar-polar phase transition”. Here, atomic-scale thickness in tan- 
dem with mechanical confinement enhances the polar phase stability 
to exhibit spontaneous polarization, eliminating one of the most criti- 
cal issues plaguing fluorite-structure ferroelectrics'®. Rapid thermal 
annealing alone is insufficient for ferroelectricity; regions of HZO 
annealed without a metal capping layer do not exhibit ferroelectric 
signatures (Extended Data Fig. 6). This highlights the critical role of 
ultrathin confinement, and the strains imposed by such layering, on 
stabilizing the polar O-phase in ultrathin fluorite-structure films. 
Along with phase contrast imaging, local PFM switching spectros- 
copy further confirms the robust ferroelectricity in ten-cycle HZO 
films, as demonstrated by 180° phase hysteresis and its butterfly-shape 
amplitude (d;3) loops (Fig. 2e). PFM spectroscopy was performed on 
metal electrodes to eliminate electrostatic artefacts from the tip” and 
potential electromechanical contributions (Methods). In addition, care- 
ful monitoring of topography was performed during poling (Extended 
Data Fig. 7a) to detect the possibility of any electrochemical and elec- 
tromechanical artefacts. Time-dependent PFM imaging (Extended Data 
Fig. 7b) on ten-cycle HZO illustrates polarization patterns sustained 
for at least 24 h; such long-term retention suggests that the PFM con- 
trast is due to ferroelectric behaviour and not due to shorter-scale 
spurious effects often attributed to amorphous hafnia®. Furthermore, 
alternating voltage (V,.)-dependent piezoresponse loops (Extended 
Data Fig. 7c) rule out electrostatic artefacts from charging”. Moving 
beyond the standard PFM optical beam detection method, interfero- 
metric displacement sensor (IDS) PFM measurements (Extended Data 
Fig. 7d) definitively demonstrate the ferroelectric origin of switching 
spectroscopy hysteresis in ten-cycle HZO. The recently developed IDS 
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Fig. 2 | Electric polarization switching in ultrathin HZO. a, Schematic of the 
Si/SiO, (2nm)/HZO (1nm) heterostructure investigated by scanning probe 
imaging. b, Schematic of the HZO unit cell in the ferroelectric orthorhombic 
structure (Pca2,). The different-coloured oxygen atoms represent the 
displaced oxygen atoms (cyan) and the centrosymmetric oxygen atoms (blue) 
within the surrounding cation tetrahedron. The blue arrows labelled Pdenote 
the polarization directions corresponding to the acentric oxygen atomic 
displacements. c, Microwave-frequency SCM spectroscopy for aten-cycle HZO 
film. The presence of butterfly-shaped C-V conclusively demonstrates 
ferroelectricity in ultrathin HZO, enabled by the high-frequency detection of 
differential capacitance (Methods). Microwave dC/dV measurements on 
multiple regions of ten-cycle HZO demonstrate the robust ferroelectric 
behaviour (Extended Data Fig. 8). d, Phase-contrast PFM images 
demonstrating stable, bipolar, remanent polarization states that can be 


technique” eliminates the long-range electrostatics and cantilever 
resonance artefacts that obfuscate typical voltage-modulated PFM. 
Contact IDS measurements demonstrate 180° phase hysteresis and 
butterfly-shaped d,;, which is indicative of ferroelectric behaviour 
free of electrostatic contributions”*. A lack of electrostatically driven 
hysteresis is confirmed by off-surface measurements (Extended Data 
Fig. 7d). Along with IDS, scanning capacitance microscopy (SCM) 
provides another advanced scanning probe technique with which 
to probe ferroelectricity in ultrathin HZO. SCM differential capaci- 
tance spectroscopy on ten-cycle HZO demonstrates butterfly-shaped 
capacitance-voltage (C-V) hysteresis (Fig. 2c, Extended Data Fig. 8). 
The microwave-frequency detection in SCM (Methods) mitigates the 
leakage currents that prevent typical bulk electrical characterization 
of ultrathin ferroelectrics, and provides conclusive evidence of fer- 
roelectric polarization switching. 

To examine the structural and electronic origins of ferroelectricity 
in HZO, we employ grazing-incidence X-ray diffraction (GI-XRD) and 
X-ray absorption spectroscopy (XAS) (Fig. 3). GI-XRD alone cannot 
unambiguously distinguish between certain fluorite-structure poly- 
morphs in ultrathin HZO; as acomplementto diffraction, XAS provides 
spectroscopic signatures of the polar O-phase and nonpolar T-phase. 
In particular, the T-phase nonpolar distortion (D,,, fourfold prismatic 
symmetry) from regular tetrahedral to fluorite-structure symmetry 
does not split the degenerate e orbitals (d,.2_,2, d3,2_,2). Meanwhile, 
the O-phase polar rhombic pyramidal distortion (C,,, twofold pyram- 
idal symmetry) does split the e manifold, providing asymmetry-specific 
spectroscopic marker (Extended Data Fig. 9a,b). Owing to the da” elec- 
tronic configuration present in Hf* (Zr*’), spectral weight from oxygen 
K-edge XAS can be attributed solely to crystal field effects, providing 
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overwritten into the opposite polarization state for aten-cycle HZO film. We 
note that the unpoled outer perimeter matches phase contrast with the 
positively poled regime regardless of the poling-polarity sequence; this 
indicates that ultrathin HZO exhibits spontaneous polarization without 
requiring ‘wake-up’ effects to become ferroelectric. Time-dependent PFM 
imaging further demonstrates the robust ferroelectric contrast (Extended 
Data Fig. 7).e, Phase and amplitude switching spectroscopy loops fora 
ten-cycle HZO film, demonstrating ferroelectric-like hysteresis. 
Interferometry-based IDS PFM hysteresis loops confirm that the origin of 
switching spectroscopy hysteresis is free of artefacts (Extended Data Fig. 7) 
and switching-spectroscopy measurements demonstrating the critical role of 
confinement during phase annealing for stabilizing the polar phase in ultrathin 
HZO (Extended Data Fig. 6). 


additional insights into the degree of structural distortion. The spec- 
tral weight of the symmetry-split e regimes (Extended Data Fig. 9c) and 
pre-edge regime (Extended Data Fig. 9e) increases with decreasing 
thickness, indicative of more pronounced rhombic distortion and 
divergence from isotropic nearest-neighbour oxygen polyhedral coor- 
dination (Extended Data Fig. 9d), respectively. These spectral weight 
trends provide further evidence of ultrathin-enhanced distortions. In 
conjunction with XAS, X-ray linear dichroism (XLD) can also probe 
structural distortions owing to its sensitivity to orbital asymmetry. 
Nanospectroscopy via soft X-ray photoemission electron microscopy 
(PEEM) illustrates spatially resolved XLD contrast at 535 eV (Extended 
Data Fig. 9f), corresponding to the e-split rhombic distortion regime. 
This suggests that XLD at the O K edge is indeed sensitive to polar fea- 
tures in ultrathin HZO. Shifting to sample-averaged XLD at the 
Zr M, edge, the orbital polarization is found to increase from the thick 
(100-cycle) to ultrathin (ten-cycle) regime (Fig. 3c), indicative of 
increased oxygen polyhedral distortion (Fig. 3d) consistent with 
ultrathin-enhanced ferroelectricity. 

Remarkably, we also observe the emergence of crystallographic 
texturing of HZO films in the ultrathin regime (Fig. 1b, 3e). We note that 
many of the reflections in 100-cycle HZO, including the dominant (111), 
are absent in the GI-XRD spectra below 25 cycles owing to the geomet- 
ric limitations of one-dimensional spectra (unable to detect all the 
reflections present in highly oriented films) (Methods). Tilted-geometry 
(g-x) diffraction (that is, pole figures) are required to access these 
oriented reflections at specific points, rather than polycrystalline-like 
rings, in reciprocal space. The spot-like patterns present in pole figures 
about the (111) reflections (Extended Data Fig. 4b) confirm the high 
degree of texturing in ultrathin HZO. Interestingly, this texturing 
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Fig. 3 | Emergence of ‘reverse’ size effects in ultrathin HZO. a, 
Thickness-dependent XAS at the O K edge; spectral weight XAS trends indicate 
enhanced polyhedral disorder and tetrahedral and rhombic distortions in 
ultrathin films, illustrated by the crystal field splitting diagram for the 
fluorite-structure structural polymorphs and symmetry-specific XAS 
simulations (Extended Data Fig. 9a, b). The eand ¢, d-electron energy manifolds 
are set by the fluorite-structure tetrahedral symmetry (Extended Data Fig. 9a). 
b, Thickness-dependent XLD at the OK edge; the orbital polarization inversion 
below 25 cycles corresponds to the onset of highly oriented ultrathin films. 

c, Thickness-dependent orbital polarization and XLD (inset) at the Zr M, edge. 
The orbital polarization trend indicates ultrathin-enhanced ZrO, tetrahedral 
distortion, schematically represented in d by acentric oxygen atomic 


happens despite local nanocrystalline regions observed in TEM for 
ultrathin films (for example, 15-cycle HZO in Extended Data Fig. 3d). 
Coinciding with the onset of texturing, the microstructural evolution 
below 25-cycle HZO manifests spectroscopically as inverted orbital 
polarization at the e manifold (Fig. 3b), suggesting flipped 
polar-distortion-split e levels (d,.2_,2and d3,2_,2). This indicates that 
sub-25-cycle ultrathin films enter a new electronic structure concur- 
rently as the crystalline structure orders. Therefore, confinement strain 
in atomic-scale fluorite films could provide a route to tailor electronic 
structure and engineer polarization at the orbital level”’, akin to 
epitaxial strain in perovskite films. 

The onset of highly ordered films also coincides with sharp rises 
in structural markers of distortion (Fig. 3g). The degree of rhombic 
distortion is captured by the lattice spacing d,,,; accordingly, d,,, is 
tied to macroscopic polarization in HZO”*”’. We observe increasing 
d,, with decreasing thickness (Fig. 3g), as previously reported in epi- 
taxial HZO films grown by pulsed laser deposition”, consistent with 
ultrathin-enhanced ferroelectricity. Notably, our low-temperature 
ALD-grown films on silicon can induce similar structurally induced phe- 
nomena observed in high-temperature pulsed-laser-deposition-grown 
epitaxial films on perovskite templates and extended to an even thin- 
ner limit. Furthermore, the d,,, bifurcation below 25 cycles suggests 
a link between texturing and amplified distortion in the ultrathin 


displacement (cyan atoms). e, Synchrotron GI-XRD demonstrating the 
emergence of highly oriented ultrathin films, consistent with ADF-STEM 

(Fig. 1b) and pole figure analysis of ultrathin HZO (Extended Data Fig. 4b). 

f, Thickness-dependent GI-XRD around the polar orthorhombic (111) 
reflection, demonstrating a systematic shift in 26,,, with thickness, and 
highlighting the limitation of GI-XRD geometry to detect the (111)-reflection 
below 25 cycles as the film becomes highly oriented (Extended Data Fig. 4). 

g, Thickness-dependent d,,, lattice spacing and 2c/(a+ b) structural aspect 
ratio, suggesting amplified polarization in the ultrathin limit, especially below 
25 cycles. Dashed lines denote reported d,,, and aspect ratio values for thicker 
ferroelectric HZO films. Aspect ratio values are extracted fromthe 
symmetry-split {200} planes (Methods). 


regime. Another crystallographic signature, orthorhombic aspect 
ratio (2c/a + b), also indicates enhanced distortions in the ultrathin 
regime (Fig. 3g). Fluorite-structure orthorhombicity” is akin to perovs- 
kite tetragonality® (c/a); these ratios serve as structural barometers 
of macroscopic polarization. The orthorhombic distortion present 
in ten-cycle HZO far exceeds any reported values for HfO,-ZrO, poly- 
morphs”: we find >10% aspect asymmetry, whereas 3-4% is typically 
reported for fluorite-structure ferroelectrics, consistent with our 
thicker films (Fig. 3g). Correspondingly, the tetrahedral and rhom- 
bic crystal field splitting energies in ultrathin films surpass expected 
polar fluorite-structure values by 1.3 eV and 700 meV, respectively 
(Extended Data Fig. 9g). Such colossal structural splittings are well 
beyond the reported limits of epitaxial strain in perovskite films”. 
Therefore, although prohibitive tunnel currents prevent accurate 
quantification of polarization from traditional polarization-voltage 
measurements, multiple structural gauges of polarization indicate 
substantial enhancement in the ultrathin limit. 

Insummary, several techniques self-consistently demonstrate robust 
ferroelectricity in HZO films of thickness downto 1nm (Extended Data 
Fig. 2,3), synthesized by low-temperature ALD on silicon. Remark- 
ably, these experiments indicate that polar distortions are amplified 
inthe ultrathin limit; diffraction markers (d,,, lattice spacing, structural 
aspect ratio) and spectroscopic signatures (orbital polarization, crystal 
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field splitting) alldemonstrate ultrathin enhancement. Such ‘reverse’ 
size effects oppose conventional perovskite ferroelectric trends®. 
Previous works on polycrystalline doped HfO,*! * have explained 
thickness-dependent polarization trends based on the volume frac- 
tion of the ferroelectric O-phase. However, our observations are more 
consistent with studies of pseudo-epitaxial HZO films grown on perovs- 
kite templates”*” in that we also observe substantial orientationin the 
ultrathin regime and enhancement of ferroelectricity with decreasing 
thickness. Importantly, our work demonstrates that this enhance- 
ment persists down to at least two fluorite-structure unit cell thickness, 
overcoming the deleterious depolarization field effects that would 
otherwise dominate a prototypical perovskite-structure ferroelectricin 
this ultrathin regime’**. Further studies should explore how the current 
understanding of film synthesis, phase competition and polar distor- 
tion in HZO developed for thicker films (>5 nm)* evolves in the ultrathin 
regime (<2 nm). Our results indicate that harnessing confinement 
strain to amplify atomic displacements in ultrathin films provides a 
route towards enhancing electric polarization at the nanoscale beyond 
epitaxial strain®**”, akin to strain gradients in flexoelectricity*®”. From 
atechnological perspective, direct monolithic integration of ultrathin 
doped HfO, on Si/SiO, paves the way for polarization-driven low-power 
memories (Extended Data Fig. 10) and ultra-scaled ferroelectrics-based 
transistors**), 
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Methods 


Sample deposition and preparation 

Thin films of Hf ,Zr,.0, were grown by ALD in a Fiji Ultratech/ 
Cambridge Nanotech tool at 250 °C, in which tetrakis (ethylmeth- 
ylamino) hafnium and tetrakis (ethylmethylamino) zirconium pre- 
cursors are heated to 75 °C and water vapour is used as the oxidant. 
For metal-ferroelectric-insulator-semiconductor structures, heavily 
p-doped Si(100) substrates (10” cm’®) are first oxidized in ambient 
O, during an rapid thermal annealing step at 900 °C for 60 s, forming 
about 2 nm of thermal SiO, on Si. For metal-ferroelectric-metal struc- 
tures, the Si substrate is coated with 30 nm TiN. Subsequently, HZO is 
deposited at 250 °C by ALD; a 4:1 ratio between the HfO, monolayer 
and the ZrO, monolayer sets the 80:20 stoichiometry of the depos- 
ited HZO, in which ten cycles corresponds to 1nm of film. After ALD 
deposition, a top metal (W or TiN) is deposited by sputtering at room 
temperature. Finally, a rapid post-metal annealing at 500 °C (30s, 
ambient N, background) stabilizes the desired polar orthorhombic 
phase. For capacitor structures (scanning probe studies), the top 
electrodes are defined by photolithography and dry etching. For bare 
structures (structural studies), the top metal is removed by chemical 
etching to expose the HZO surface. Further details pertaining to ALD 
growth conditions, post-deposition processing, and so on are outlined 
ina previous work™. All thin film synthesis was performed at Univer- 
sity of California, Berkeley; processing was performed at the Marvell 
Nanofabrication Laboratory at University of California, Berkeley. One 
nanometre of chemically grown SiO, on Si was prepared by the standard 
clean (SC-1) solution (5:1:1 H,O:H,0,:NH,OH at 80 °C for 10 min) after 
the Si wafer was cleaned in Piranha (120 °C for 10 min) to remove organ- 
ics and HF (50:1 H,O:HF at room temperature for 30 s) to remove any 
native oxide. Thinner SiO, was employed to help reduce depolarization 
fields and improve the electric field distribution through the ultrathin 
ferroelectric HZO layer. 


Electron microscopy 

Electron microscopy was performed at the National Center for Electron 
Microscopy facility of the Molecular Foundry at LBNL as well as by 
Nanolab Technologies Inc.,a commercial vendor. At the National Center 
for Electron Microscopy, TEM samples were prepared by mechanical 
polishing on an Allied High Tech Multiprep and subsequently Ar ion 
milled using a Gatan Precision Ion Milling System at shallow angles 
(5° to 3°) with starting energies of 5 keV stepped down toa final cleaning 
energy of 200 eV to reduce ion-induced damage. High-angle annu- 
lar dark-field (HAADF) scanning transmission electron microscopy 
(STEM) images were recorded on TEAM I, an aberration-corrected FEI 
Titan 80-300 operated in STEM mode at 300 kV with a convergence 
semi-angle of 17 mrad, 70 pA probe current, and collection angles 
>40 mrad. The local thicknesses of the respective HZO layers were 
determined from calibration to the Si(110) interplanar lattice spacing 
(Extended Data Fig. 3), consistent with global thicknesses extracted 
using XRR (Extended Data Fig. 2). 


Scanning probe microscopy 

Piezoresponse microscopy and spectroscopy. PFM measurements 
(Extended Data Figs. 6, 7) were performed using acommercial scanning 
probe microscope (Asylum MFP-3D) at the University of California, 
Berkeley. Dual-frequency resonance-tracking PFM* was conducted 
using a conductive Pt/Ir-coated probe tip (NanoSensor PPP-EFM) to 
image written domain structures and measure switching-spectroscopy* 
piezoelectric hysteresis loops. Resonance-enhanced PFM increases 
the signal-to-noise ratio for the detection of out-of-plane electric po- 
larization, which is critical for ultrathin films. Contact was made to 
the bottom TiN electrode or heavily doped Si substrate for grounding 
in PFM studies. All PFM phase-contrast images and hysteresis loops 
shown were performed on ten-cycle (about 1 nm) HZO films unless 


otherwise indicated. PFM imaging was performed with the tip in direct 
contact with the HZO layer. Switching-spectroscopy hysteresis loops 
were measured on capacitor structures to help eliminate electrostatic 
artefacts from the tip®, mitigate possible electromechanical contri- 
butions”, and to yield more confined electric fields. V,.-dependent 
piezoresponse loops (Extended Data Fig. 7c) examined the ferroelectric 
origin of the PFM signal—as opposed to tip bias-induced artefacts”—in 
ultrathin HZO films. The piezoresponse OFF loop collapsed once V,, 
exceeded the coercive voltage*®, as expected for ferroelectric behav- 
iour. Piezoresponse is defined as Acos@, where A and @ are the PFM 
amplitude and phase, respectively**. We note that the non-ideal shape 
of the piezoresponse loops, particularly at higher voltages, are caused 
by non-ferroelectric artefacts from the additional dielectric SiO, layer 
through which most of the voltage is dropped. For all PFM studies, the 
bias was applied to the tip. 


Effective coercive field. The switching voltage from PFM loops exag- 
gerate the coercive field of the ultrathin HZO layer once considering 
the potential distribution across the modified metal-oxide- 
semiconductor structure (oxide bilayer SiO,-HZO). The effective co- 
ercive field of the HZO layer can be determined using a simple 
dielectric-ferroelectric bilayer model, ignoring accumulation and 
depletion regions at the moment just to approximate the coercive field. 
Considering appropriate electrical boundary conditions across the 
oxide interface (€p-Fpr = €rrér), the voltage across the ferroelectric 
layer (V;,) can be expressed in terms of the total voltage given by the 
PFM loop (V,,,): 


-1 
tor € 
Vee=[L+ eVect 
tre €DE 


where the dielectric constant for the oxide layers are taken as €p,=3.9 
(SiO,) and as €,, = 24 (HZO)* and the thicknesses of the oxide layers 
are tp;=2nm andt,,=1nm. These values yield V,; = V,,,/13, so the effec- 
tive coercive field of the ten-cycle (1 nm) ferroelectric HZO layer is 
approximately 2 MV cm, consistent with values of thicker HZO films 
reported in the literature”. 


Interferometry. IDS PFM measurements (Extended Data Fig. 7d) were 
performed using a commercial scanning probe microscope (Asylum 
Cypher) with an integrated quantitative laser Doppler vibrometer at 
Asylum Research (Santa Barbara). This recently developed method”® 
eliminates crosstalk and other artefacts present in voltage-modulated 
piezo-measurements (d,;) by replacing the typical slope-sensitive 
optical beam detector with a displacement sensitive interferometer. 
By positioning the IDS laser directly over the tip, motion of the tip can 
be decoupled from spurious motion of the cantilever body. Motion of 
the cantilever body can be driven by long-range electrostatics and is 
influenced by the transfer function of the cantilever”®. IDS measure- 
ments were performed with a3 N m' Ti/Ir coated cantilever placed 
onthe bare 1-nm HZO surface at drive frequency 250 kHz and average 
force 75 nN. Off-surface loops (tip raised from surface), which measure 
the extrinsic electrostatic contributions”’, were performed by chang- 
ing the trigger value from deflection (force) to the z-sensor read-out. 
The lack of hysteresis from off-surface (non-contact) loops further 
support the finding that the hysteresis observed from on-contact 
IDS measurements are free from electrostatic contributions. Typical 
voltage-modulated PFM measurements often display false ferroelectric 
hysteresis due to long-range cantilever dynamics inherent to detection 
by the optical beam detector, as observed from non-contact hysteresis 
innon-piezoelectric samples”°. 


Microwave capacitance. SCM measurements (Extended Data Fig. 8c) 
were performed using a commercial scanning probe microscope 
(Asylum Cypher) at Asylum Research (Santa Barbara). Differential 
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capacitance (dC/dV) measurements were performed at 1.8-GHz fre- 
quency with a 40 kHz lock-in frequency at 0.5 V,,. Pure Pt cantilevers 
(Rocky Mountain Nanotechnology) are placed on top of bare HZO 
surface (contact mode) on TiN-buffered Si for SCM measurements; 
dC/dV signals were extracted via V,. applied between the SCM tip and 
bottom electrode. Capacitance-voltage loops via SCM have been previ- 
ously used to confirm ferroelectricity in SrBi,Ta,O. (SBT) thin films”. 
The microwave-frequency nature of the measurement lends itself to 
probing ultrathin ferroelectrics, as it mitigates leakage contributions. 
For SCM measurements, the bias was applied to the sample (swept up 
to +8 V), not to the tip as is done for PFM measurements. 


X-ray diffraction 

Structural characterization. Synchrotron GI-XRD (Extended Data Fig. 
4a) was performed at the Sector 33-BM-C beamline of the Advanced 
Photon Source, Argonne National Laboratory. Using synchrotron 
GI-XRD, we investigated the structural evolution from polycrystalline 
bulk-like (100-cycle) HZO down to highly textured ultrathin (<25 cycles) 
HZO inits polar orthorhombic (Pca2,) phase, at grazing angle <@=0.35°. 
The high flux from the synchrotron source (A = 0.775 A) enabled col- 
lection of sufficient diffraction intensity from the few crystallographic 
planes presentin ultrathin HZO samples. High-resolution GI-XRD was 
also performed using a laboratory-based Panalytical X’Pert Pro X-ray 
diffraction system (CuK, radiation, A=1.54056 A) on HZO films thicker 
than2 nmat grazing angle 0 =0.35°. Previous work employed selected 
area electron diffraction*’ and convergent beam electron diffraction” 
to attribute ferroelectricity in HfO,-based films to the polar orthorhom- 
bic (Pca2,) phase. The indexing of ultrathin HZO films performed in this 
work is consistent with the same polar orthorhombic phase determined 
from these previous electron diffraction studies. 


Texture analysis. Pole figures (Extended Data Fig. 4b) were measured 
at Sector 33-BM-C beamline of the Advanced Photon Source, Argonne 
National Laboratory. For fixed Q values—corresponding to the d,, lat- 
tice spacing—the 4-circle Huber diffractometer rotated in-plane (@) 
360° at multiple values of out-of-plane tilt (y). The PILATUS 100K pixel 
area detector collected volumetric reciprocal space data from which 
two-dimensional pole figure slices were plotted for shells of constant 
Q,. The four concentrated reflections for the {111} projection in Q,-Q, 
space indicate highly oriented texture, rather than the diffuse rings 
expected for polycrystalline films. As indicated in the main text (Fig. 3), 
films thinner than 25 cycles display substantial texturing (Extended 
Data Fig. 4); in particular, the (111) reflection, which is dominant for 
thicker films, is diminished in GI-XRD spectra of ultrathin films ow- 
ing to the geometric limitations of a one-dimensional pattern (it is 
unable to detect all reflections present in oriented films). This limi- 
tation necessitates tilted-geometry two-dimensional patterns (pole 
figures) in order to detect all reflections present in highly oriented 
films. Meanwhile, reflections corresponding to polycrystalline films 
exhibit continuous rings in two-dimensional Q,-Q, reciprocal space, so 
any one-dimensional line-cut (GI-XRD spectra) would detect all reflec- 
tions present. The results on ultrathin HZO films are in stark contrast 
to results on thicker films*°” and indicate that, for such ultrathin films, 
crystallization and orientation need to be considered together. We also 
observed that for such thin films, the template for HZO growth needs 
to be atomically smooth. Therefore, the Si/SiO, interface employed in 
this work is critical; the growth surface is expected to have a larger role 
for ultrathin films than for thicker films. 


Thickness confirmation. Synchrotron XRR of ultrathin HZO films 
(Extended Data Fig. 2)—performed at Sector 33-BM-C beamline of 
the Advanced Photon Source, Argonne National Laboratory and at 
Beamline 2-1 of the Stanford Synchrotron Radiation Lightsource, SLAC 
National Accelerator Laboratory—confirmed the thickness of sub- 
2-nm films. Fitting analysis was performed with the Python package 


xrayutilities”. XRR of thicker HZO films (>2 nm) was measured with 
the Panalytical X’Pert Pro system, and thickness fitting was performed 
with Panalytical software. The extracted growth rate of 11 cyclesnm7™ 
is consistent with results from TEM and literature”. 


Structural distortion analysis. For the polar orthorhombic phase 
(Pca2,), we consider the orthorhombic distortion (that is, orthorhom- 
bicity) as the aspect ratio: 2c/(a+ b) to enable easier comparison to the 
tetragonal (P4,/nm) aspect ratio c/a. Fluorite-structure orthorhombic- 
ity is meant to be analogous to the perovskite ferroelectric tetragonal- 
ity (c/a, where cis the polar axis); both aspect ratios serve as a struc- 
tural gauge of the macroscopic polarization because they are indicative 
of the polar distortion present in their respective structures”. Notably, 
the orthorhombic distortion present in HZO is enhancedin the ultrathin 
regime—opposite to the typical tetragonal distortion trend in perovs- 
kite ferroelectrics°—indicative of the ‘reverse’ size effects present in 
fluorite-structure ferroelectrics. For example, the tetragonal aspect 
ratio was shown to decrease with decreasing thickness in ferroelectric 
PbTiO, films™, while the orthorhombic aspect ratio is greatly enhanced 
inthe ultrathin regime in our fluorite-structure HZO films (Fig. 3g). The 
orthorhombic distortion present in ten-cycle (about 1 nm) HZO far 
exceeds any reported values for HfO,-ZrO, polymorphs: we find >11% 
aspect asymmetry, while 3-4% is typically reported for fluorite-structure 
ferroelectrics”, consistent with our thicker films (Fig. 3g). Indeed, a 
strong relationship between this aspect ratio and the remanent po- 
larization value has been experimentally demonstrated in thicker 
doped HfO, films**. Therefore, the colossal orthorhombic aspect ratio 
present in ten-cycle HZO is consistent with ultrathin-enhanced ferro- 
electricity. The orthorhombic aspect ratio is calculated from the posi- 
tion of various diffraction peaks indexed to the Pca2, phase (the 200, 
020 and 002 peaks), using the following crystallographic relations: 
A=2:dy 9, D=2-do29, C= 2°dgg2 Where A499, dpr9, aNd doo are the 200, 020 
and 002 lattice spacings determined via Bragg’s law and the respective 
peak positions. These values are self-consistently checked against the 
111 interplanar lattice spacing ( 1/d7,,=1/a?+1/b’ + 1/c? ) as well as 
against other orientations present in the diffraction spectra. The aspect 
ratio of the polar O-phase exceeds that of the T-phase (c/a) for doped 
Hfo,™. Another structural marker indicates amplified distortions as 
thickness is reduced, namely the interplanar lattice spacing d,,,. The 
origin of the left shift in the O-phase 111 (T-phase 101) reflection (Fig. 3f) 
with decreasing thickness (that is, decreasing ALD cycles) is typically 
attributed to the abovementioned phase transition (nonpolar T-phase 
to polar O-phase); the left shift of the peak in reciprocal space corre- 
sponds to an increase in real-space lattice spacing. Extending this 
analogy to the ultrathin regime in which the polar O-phase is already 
stabilized, the ultrathin enhancement of d,,, (Fig. 3g) indicates a further 
increase in rhombic distortion (structurally represented by d,,,). Recent 
works on epitaxial HZO films grown by high-temperature pulsed laser 
deposition on perovskite substrates also indicate increasing d,,, with 
decreasing thickness”®*”; these works find the electric polarization to 
increase with increasing d,,,. Similarly, we expect alarger polarization 
in our ultrathin films based on the d,,, trend (Fig. 3g); notably, our 
low-temperature ALD-grown highly oriented films are mimicking the 
trends observed in high-temperature pulsed-laser-deposition-grown 
epitaxial films. 


X-ray spectroscopy 

XAS and XLD. X-ray absorption spectroscopy (XAS) and XLD was per- 
formed at the Advanced Light Source beamline 4.0.2. XAS measure- 
ments were taken at the oxygen K edge (520-550 eV) and Zr M, edge 
(345-355 eV). X-rays were incident at 20° off grazing. XLD (XAS) was 
obtained from the difference (average) of horizontal and vertical linear- 
ly polarized X-rays. To eliminate systematic artefacts in the signal that 
drift with time, spectra were captured with the order of polarization 
rotation reversed (suchas horizontal, vertical, vertical and horizontal) 


insuccessive scans. An elliptically polarizing undulator was used to tune 
polarization and photon energy of the synchrotron X-ray source. XAS 
was recorded under total electron yield mode®. 


Simulated XAS and crystal field symmetry. Simulated XAS spectra 
for the various fluorite-structure polymorphs were computed through 
the Materials Project® open-source database for the XAS spectrum”. 
In particular, the following symmetries for HfO, and ZrO, were inves- 
tigated: monoclinic P2,/c (space group 14), orthorhombic Pca2, (space 
group 29), and tetragonal P4,/nmc (space group 137). Comparisons 
between HZO and the undoped fluorite-structure endmembers (in 
particular, qualitative comparison of splitting-induced spectroscopy 
features) are reasonable owing to the extremely low structural dis- 
similarity between the same polymorphs of HfO, and ZrO,, as deter- 
mined by pymatgen™. The T-phase (P4,/nmc) nonpolar distortion (D4, 
fourfold prismatic symmetry) from regular tetrahedral (7, full tetra- 
hedral symmetry) fluorite-structure symmetry does not split the 
degenerate e bands (d,.2_,2, d3,2_,2),as confirmed by experiment” and 
the XAS simulations (Extended Data Fig. 9b). Meanwhile, the O-phase 
(Pca2,) polar rhombic pyramidal distortion (C,,, twofold pyramidal 
symmetry) does split the e-manifold based on crystal field symmetry 
(Extended Data Fig. 9b), providing a spectroscopic means to distinguish 
the T- and O-phases. The eightfold Hf-O (Zr-O) coordination (Extended 
Data Fig. 9d) in the tetragonal phase (D,, point group symmetry) can 
be decomposed into two tetrahedra that are the space inversion twins 
of one another. Therefore, crystal field splitting of the elevels matches 
that ofa single tetrahedron®—thatis, there is no further splitting. Mean- 
while, the sevenfold Hf-O (Zr-O) coordination (Extended Data Fig. 9d) 
inthe orthorhombic phase cannot be decomposed into two tetrahedra; 
the additional rhombic distortion (not present in the T-phase) splits 
the e manifold. The simulated XAS spectra for T- and O-phase ZrO, 
(Extended Data Fig. 9b) supports this picture, because the additional 
spectroscopic feature present between the main e- and ¢,- absorption 
features in the O-phase is presumably caused by this additional 
symmetry-lowering distortion. The XAS spectra of the HZO thickness 
series (Extended Data Fig. 9c) demonstrates tetrahedral and rhombic 
splitting features closely matching the polar O-phase (Pca2,). This 
demonstrates a spectroscopic method for phase identification beyond 
diffraction—ambiguous owing to the nearly identical T- and O-phase 
lattice parameters”—whose signatures are more sensitive to the subtle 
structural distortions present as symmetry is lowered from the T- to 
the O-phase. 


Crystal field splitting. Notably, the crystal field distortions present 
in confined HZO films greatly exceed what is typically observed in 
bulk fluorite-structures and perovskite ferroelectrics (Extended Data 
Fig. 9g); the tetrahedral (rhombic) crystal field A; (Ag) arising fromthe 
T,(C,,) symmetry in ten-cycle HZO films is 1.3 eV (0.7 eV) greater than 
what is expected from fluorite-structure ZrO, in the polar orthorhombic 
phase (Pca2,). The computational XAS for the Pca2, phase already takes 
the polar distortion (A,) into account; so the enhanced A, in ultrathin 
confined films again points to enhanced polar distortions (consist- 
ent with diffraction-based results). Kindred efforts to uncover routes 
towards enhanced nanoscale distortions have been explored in com- 
plex perovskite heterostructures. For example, in nickelate perovskite 
superlattices, enormous A,, crystal field splitting (up to 0.8 eV) has 
been achieved via polar fields resulting from internal charge transfer”; 
>10% epitaxial strain would be required to induced such large ionic 
distortions in that particular system, well beyond the limits of epitaxial 
strain, which can only achieve e, splitting of about 300 meV (ref. °°). 


Spectral weight trends. The relative spectral weight of the e and ¢, 
manifolds (Extended Data Fig. 9c) at the O K edge can also provide 
insight into the degree of structural distortion”. Owing to the d” elec- 
tronic configuration present in Hf* (Zr*), all dstates are available for 


mixing with O 2p states, so the analysis of e-t, spectral weight can be 
simplified to be purely due to crystal field effects. Tetrahedral sym- 
metry lowers e bands relative to t, bands due to the enhanced t, orbital 
overlap with oxygen 2p orbitals. The enhanced t,/e spectral weight as 
thickness is reduced (Extended Data Fig. 9c) indicates the preference for 
O 2p hybridization with Hf 5d (Zr 4d) t, orbitals, further exaggerating the 
disparity set by the tetrahedral symmetry as the symmetry is lowered 
to the polar O-phase. Additionally, the increase in spectral weight of 
the pre-edge shoulder (Extended Data Fig. 9e) provides further confir- 
mation that structural distortions are amplified in the ultrathin limit. 
Pre-edge features at the O K edge in complex transition metal oxides 
are commonly attributed to nearest-neighbour variations from typical 
oxygen polyhedral coordination as the symmetry is lowered by various 
distortions™. Analogously, here the pre-edge feature is attributed to 
variation from eightfold coordination in the T-phase (NN =8) as the sym- 
metry is lowered into the polar O-phase (NN =7) (Extended Data Fig. 9d). 
On the unit cell level in the polar O-phase, the central metal cation is 
surrounded by an asymmetric oxygen coordination environment (note 
the 4 blue and 3 cyan oxygen atoms in Extended Data Fig. 9d) owing to 
the polar rhombic distortion of normal tetrahedral (7,) symmetry; this 
polyhedral distortion can manifest as increased spectral weight at the 
oxygen K pre-edge®. The critical e manifold splitting due to the polar 
rhombic distortion also increases in spectral weight as thickness is 
reduced (Extended Data Fig. 9c). The XAS spectral weight trends mir- 
ror the structural indicators of ultrathin-enhanced distortion (Fig. 3c). 


Orbital polarization. In conjunction with XAS, XLD can also probe 
structural distortions owing to its sensitivity to orbital asymmetry, 
which can arise from inversion symmetry breaking. For example, inthe 
perovskite ferroelectrics PbTiO, and BaTiO,, the Ti 3d to O 2p orbital 
hybridization is essential for stabilizing the noncentrosymmetric struc- 
ture®. Particularly at the 3d cation, edge, orbital polarization extract- 
ed from XLD is used as a measure of the oxygen octahedral distortion 
in perovskites owing to the anisotropic hybridization between cation 
3dand O 2p orbitals“. Accordingly, in fluorite-structure ferroelectrics, 
the magnitude of XLD present at the Zr M;,, edges can bea gauge of the 
degree of polyhedral distortion (in this case, a distortion of the oxygen 
tetrahedron) and the oxygen atomic asymmetry. Indeed, the orbital 
polarization at the Zr M, edge is enhanced as the thickness is reduced 
from the thick (100-cycle) to ultrathin (ten-cycle) regime (Fig. 3c), 
consistent with diffraction-based results demonstrating amplified 
structural distortions in the ultrathin limit. Spectroscopy can also help 
understand the evolution to highly textured films in the ultrathin limit 
(Fig. 3e, f), as XLD enables both element- and orbital-specific infor- 
mation by comparing polarization-dependent XAS spectra. GI-XRD 
across the thickness series (Fig. 3f) indicates that the degree of orien- 
tation substantially changes as the HZO drops below about 2.5 nm (25 
cycles). The microstructure change below 25 cycles also manifests as 
inverted orbital polarization at the oxygen K edge, particularly at the 
emanifold (Fig. 3b). Absorption of vertically and horizontally polarized 
light preferentially probes the polar-distortion-split e levels (the x*y” 
and 3z’r d orbitals); the reversal of XLD sign indicates these levels are 
inverted with respect to one another. In perovskites, such a change in 
orbital polarization is often attributed to different signs of tetragonal 
distortion (c/a) of the oxygen octahedron™. Analogously, here the 
change in microstructure across 20-25 cycles, namely, the emergence 
of highly oriented films, could allow confinement strain effects to dis- 
tort the oxygen tetrahedron more coherently along the polar axis. This 
synergistic effect could potentially explain the enhanced distortions 
observed in the ultrathin regime. 


Nanospectroscopy. PEEM was performed at the Advanced Light 
Source beamline 11.0.1. X-rays were incident at 30° off grazing, prob- 
ing just the first few nanometres of film, spanning the entire ten-cycle 
(1nm) HZO thickness. Nanospectroscopy point-by-point scans were 
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employed to spatially resolve XLD contrast; at each specified energy 
value in the oxygen K edge (520-550 eV) regime, PEEM images were 
taken for both values of the linear polarization (horizontal, vertical) 
across a 20-ym field of view (1,000 x 1,000 pixel grid). The exposure 
to high-flux synchrotron X-rays probably depolarized the ultrathin fer- 
roelectric sample as photoelectrons were removed from the surface, 
as is observed in ultrathin films of BaTiO, and other ferroelectrics; 
PEEM-XLD images (Extended Data Fig. 9f) illustrate nanoscale do- 
mains at the energy range corresponding to the polar e-split feature. 
Data processing to extract XLD contrast involved dividing images of 
opposite linear polarization, which eliminates topography and work 
function contrast. Topography and work function artefacts contribute 
at the pre-edge (about 530 eV), whereas the intrinsic orbital anisotropy 
contributions manifest only at resonance (about 535 eV); the presence 
of XLD contrast only at resonance confirms the orbital asymmetry ori- 
gins of XLD contrast in ultrathin HZO. Furthermore, the highly textured 
nature of the ultrathin films prevents the XLD contrast from averaging 
to zero (cancellation would be expected for a fully polycrystalline film) 
onalength scale smaller than the experimental resolution. 


Optical spectroscopy 

SHG and inversion asymmetry. Nonlinear optical SHG was performed 
using acustom setup at University of California, Berkeley, as detailed 
ina previous work”. The excitation light was extracted using an optical 
parametric oscillator (Inspire HF 100, Spectra Physics, Santa Clara) 
pumped by a mode-locked Ti:sapphire oscillator. The excitation laser 
was linearly polarized by a900-1,300 nm polarizing beamsplitter. The 
transmitted p-polarized laser light can change its polarization by rotat- 
ing an infrared half waveplate before pumping the sample. The laser 
is focused by a 50x near-infrared objective onto the sample. The SHG 
signal was detected in the backscattering configuration, analysed bya 
visible-range polarizer, and finally collected by a cooled charge-coupled 
device spectrometer. SHG was performed with a 960-nm pump and 
detected at 480 nm under tilt incidence. SHG is commonly used to in- 
vestigate piezoelectric and ferroelectric single crystals and thin films® 
as the photon frequency-doubling process is allowed only in materials 
lacking inversion symmetry. 


Field-dependent SHG. Electric-field-dependent SHG experiments 
were performed on the bare surface of ten-cycle (Inm) HZO films (top 
metal was etched away after phase annealing). The HZO layer was then 
patterned into micrometre-sized islands to enable systematic identi- 
fication of specific HZO regions; various islands were poled with an 
electric field (applied by a PFM tip), while other islands were left as 
is. The optical microscope identified the poled and unpoled islands, 
and the second harmonic signal was detected across various islands. 
Increased SHG intensity, sensitive to out-of-plane polarization in this 
tilt-incidence experimental geometry, in poled HZO islands suggests 
that the electric field increases the projection of out-of-plane polariza- 
tion by aligning domains with different polarization directions. 


Electrical characterization 

Tunnel current measurements. Tunnel current measurements were 
performed using a commercial Semiconductor Device Analyzer 
(Agilent B1500) with a pulse generator unit to enable voltage pulses 
downto the microsecond regime. Samples were patterned into capaci- 
tors of various area, with W as the top electrode, and heavily doped Si 
(10° cm) as the bottom contact. The 19-m W tips (DCP-HTR154-001, 
FormFactor) made electrical contact within a commercial probe station 
(Cascade Microtech). Intandem, conducting atomic force microscopy 
measurements were performed using a commercial scanning probe 
microscope (Asylum MFP-3D) at University of California, Berkeley. 
Current-voltage characteristics through the capacitor device were 
probed inthe AFM by using a Keithley 2400 Source Measure Unit to bias 
the top electrode of the sample through 20-nm-radii Pt/Ir-coated AFM 


probes (25PtIr300B cantilever probe, Rocky Mountain Nanotechnol- 
ogy), grounded to the heavily doped Si substrate. 


Current-voltage hysteresis and tunnel electroresistance. We used 
voltage-polarity-dependent current-voltage hysteresis to rule out resis- 
tive switching mediated by dielectric breakdown and filamentary-type 
switching. For filamentary-mediated resistive switching—often ob- 
served in amorphous HfO,—the sense of hysteresis is dependent on 
the direction of the voltage sweep (that is, the initial polarity of the 
voltage waveform), which dictates the filament formation®. Mean- 
while, ferroelectric tunnel junctions demonstrate the same sense of 
current-voltage hysteresis independent of the sweep direction; this 
voltage-polarity independence is indicative of polarization-mediated 
switching, as observed for our ultrathin ten-cycle (1 nm) HZO films 
(Extended Data Fig. 10e). To further investigate the origin of the re- 
sistive switching, tunnelling electroresistance hysteresis maps as a 
function of write voltage (at low read voltage) demonstrate saturating, 
abrupt hysteretic behaviour (Extended Data Fig. 10b, d) characteristic 
of polarization-driven switching’. Evidence of polarization-driven 
resistive switching from tunnelling electroresistance is provided for 
ten-cycle (Inm) HZO films of two different compositions (Extended 
Data Fig. 10). Although many of the results presented here are for 
films with 4:1 Hf:Zr ratio, for comparison, we have included results 
and demonstrated ferroelectricity for a ten-cycle (1 nm) film with this 
modified 1:1 Hf:Zr ratio. Pioneering work on HZO inthe thicker regime 
(>5 nm)*°*° has shown that a 1:1 Hf:Zr ratio often demonstrates the 
best ferroelectric properties. Ferroelectric tunnel junctions based on 
composite ferroelectric-dielectric barriers using HZO in this thicker 
regime demonstrate promising polarization-driven resistive switch- 
ing results”’”*. Optimizing ferroelectric tunnel junction behaviour 
employing HZO in the ultrathin regime (around 1 nm) will need to be 
carefully studied. 


Data availability 


The datasets generated during and/or analysed during the current study 
are available from the corresponding author on reasonable request. 
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Extended Data Fig. 1| Size effects in fluorite- and perovskite-structure 
ferroelectrics. a, In perovskite ferroelectrics, the polar ‘tetragonal’ distortion 
(c/a) can be represented as the centre cation displacement with respect to its 
surrounding oxygen octahedron. b, In fluorite-structure ferroelectrics, the 
polar ‘rhombic’ distortion (2c/(a + b)) can be represented as the centre anion 
displacement with respect to its surrounding cation tetrahedron; inthe 
nonpolar T-phase, the oxygen atom (blue) lies inthe polyhedral centre of the 
tetrahedron. The evolution of the bulk-stable M-phase to the high-symmetry 
T-phase and polar O-phase in the fluorite-structure structure illustrates the 
role of size effects (surface energies favour higher symmetry) and confinement 
strain (distortions favour lower symmetry) on stabilizing inversion asymmetry. 
Surface energies are critical when considering the role of size effects on 
ferroelectricity; higher-symmetry phases are energetically favourable at 
reduced dimensions owing to lower unit cell volumes. In fluorite structures 
(perovskites), the noncentrosymmetric O-phase (T-phase) has higher (lower) 
symmetry than the bulk-stable centrosymmetric M-phase (C-phase). 
Consequently, surface energies help to counteract depolarization fields in 
fluorite-structure ferroelectrics—already diminished in fluorite structures 
relative to perovskites owing to its lower dielectric constant*—in the ultrathin 
regime. Therefore, both intrinsic (surface energies) and extrinsic (confinement 
strain) mechanisms can favour ultrathin inversion symmetry breaking in 
fluorite structures. Meanwhile, both surface and depolarization energies tend 
to destabilize inversion asymmetry in perovskite ferroelectrics, while epitaxial 
strain can stabilize symmetry-lowering polar distortions®. 
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Extended Data Fig. 2 | Thickness verification of ultrathin HZO films from 
XRR.a, Laboratory diffractometer XRR of HZO thickness series, 
demonstrating clear fringes for thickness extraction present downto 20-cycle 
HZO. b, Synchrotron XRR of ultrathin HZO films, enabling thickness fitting 
analysis for sub-20-cycle films. c, HZO thickness as a function of ALD cycles, as 
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determined by fitting analysis from XRR. The growth rate is about 11 

cycles nm", verified across 10-100 ALD cycle films. Squares (circles) represent 
thicknesses extracted from fitting to synchrotron (laboratory diffractometer) 
XRR measurements. 
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Extended Data Fig. 3 | Thickness verification of ultrathin HZO films using 
TEM.a, HZO thickness asa function of ALD cycles, as determined by Si atomic 
lattice calibration from TEM imaging. The growth rateis ~11 cyclesnm‘™, 
verified across 10-50 ALD cycle films, consistent with XRR (Extended Data 
Fig. 2). The red error bars reflect 20 variation. b, Cross-sectional ADF STEM 
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image of 20 cycles HZO. c-e, Cross-section TEM images of ten-cycle HZO (c), 
15-cycle HZO (d) and 40-cycle HZO (e). f-h, Wide field-of-view TEM images of 
ten-cycle HZO (f),15-cycle HZO (g) and 40-cycle HZO (h) to providea 
perspective of the heterostructure uniformity. The Si substrate is oriented 
along the [110] zone axis for all TEMimages. 
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Extended Data Fig. 4| Emergence of highly-textured films in the ultrathin 
regime. a, Synchrotron GI-XRD scans (A=0.775 A) of HZO thickness series 
endmembers:10-cycle and 100-cycle. The 100-cycle HZO film is indexed 
according to the polar orthorhombic phase Pca2,. Many of the polycrystalline 
reflections, most notably the (111), are no longer present at an appreciable 
intensity in the ultrathin limit owing to the geometric constraints of 
one-dimensional spectra (unable to probe all reflections present in highly 
oriented films) (Methods). Instead tilted-geometry diffraction (pole figures) 
are used to access the oriented reflections. b, Pole figure of ten-cycle HZO, 
taken ataQ, slice corresponding to the film (111) lattice spacing. The radial 
direction represents x, while the azimuthal direction represents @ (0°-360° 
range). The presence of four intense peaks corresponding to the four 
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(111)-projections indicate the highly textured nature of the ultrathin HZO film. 
The four Si (111)-projections would be expected at p =45° off from the Q,, 
principal axes at a smaller value of Q,. c, Schematic of the (311) (left) and (111) 
(right) close-packed planes in the fluorite-structure structure. All the cation 
sites lie on such planes, which minimize surface energy effects because only 
metal-oxygen dangling bonds are present out-of-plane. We note that all 
schematics reflect stacking of the respective planes toa total thickness of lnm, 
although ultrathin HZO films may not exhibit such stacking throughout the 
film. For ten-cycle films, {311} indexing is consistent with the relevant intensity 
(about 30°) observed in the out-of-plane one-dimensional GI-XRD pattern (a), 
and the (111) reflections are present from the two-dimensional pole figure 
pattern (b). 


Article 


a 
960 nm, ~200fs 5 ; 
Ti:Sapphire + OPO fs 
1D motorized stage 
LP filter 1mm Iris 
Attenuator 
NIR PBS 
x 
L z 960 nm pump 
Spectrometer 
480 nm SHG 
DMSP. . 
Sample BP+SP filter 
b c 
960 nm Pump 
—— poled 

—— unpoled 
3 
Poled Unpoled a 
= 
| | : 
2 
7) 
< 
® 
2 
= 
oO 
HZO (1 nm) Q 
7) 

SiO, (2 nm) 


p** Si (substrate) 
420 440 460 480 500 520 540 


Wavelength (nm) 


Extended Data Fig. 5|Inversionsymmetry breaking in ultrathin HZO via 
SHG. a, Schematic of the SHG experimental setup, using a960-nm pump and 
SHG intensity detected around 480 nm under tilt incidence, whichis sensitive 
to out-of-plane polarization (Methods). NIR, near-infrared; 1D, 
one-dimensional; PBS, polarized beam splitter; Obj, objective; LP, BP and SP 
represent long-pass, band-pass and short-pass filters; DMSP, dichroic 
short-pass mirror; M1, M2 and M3 refer to mirrors; OPO, optical parametric 
oscillator. b, Schematic of the ten-cycle HZO islands probed by SHG (Methods); 
micrometre-sized islands enabled identification of specific HZO regions either 
poled with an electric field (applied by a PFM tip) or left as is. For these 
experiments, heavily doped (10” cm”) p-type Si substrates (p** Si) are used to 
serve as the bottom electrode. c, SHG spectrum onaten-cycle HZO film, 
comparing poled versus unpoled SHG intensity. Spontaneous polarization is 
demonstrated by the presence of SHG—allowed only for inversion asymmetric 
systems—in unpoled ten-cycle HZO. This is consistent with PFM phase contrast 
in unpoled HZO regions (Fig. 2c), indicating elimination of the ‘wake-up’ effects 
for ferroelectricity in ultrathin HZO. The enhanced SHG contrast in poled 
films—possibly due to the electric field converting a small fraction of the film to 
the polar phase or aligning polar domains—indicates that the mechanism 
behind the SHG contrast is field-tunable. This field-enhanced SHG is consistent 
with ferroelectric origins and would probably eliminate SHG contrast from 
surface effects. 
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Extended Data Fig. 6| Role of ultrathin confinement for polar phase 
stabilization. a, b, Schematic structure (left) probed by PFM (tip location 
indicated by arrows), topography (centre), and PFM phase contrast images 
(right) onten-cycle HZO ina region that was uncapped (a) versus confined (b) 
by W (represented by ‘M’ for metal in the schematic) during phase annealing. 
Robust 180° phase contrast is only present for the confined HZO. c, Phase (left) 
and amplitude (right) switching spectroscopy loops (V,.=0, ‘OFF’ state) asa 
function of bias voltage on ten-cycle HZO films, demonstrating the critical role 
of confinement during phase annealing in stabilizing ferroelectricity in 
ultrathin HZO. 180° phase contrast and butterfly-shaped amplitude are 
present only for confined HZO. Therefore, both switching-spectroscopy PFM 
and PFM imaging illustrate the critical role of confinement during phase 
annealing for stabilizing the ferroelectric phase. For the PFM images, +7 V was 
applied ina ‘box-in-box’ poling pattern directly on the HZO surface, and 
switching-spectroscopy PFM loops were measured on capacitor structures 
(Methods). d, Schematic structure (left) probed by PFM (tip location indicated 
by arrows) and PFM phase and amplitude hysteresis loops (right) as a function 
of bias voltage on 100-cycle HZO ina region that was confined by W during 
phase annealing. Thicker 100-cycle HZO also demonstrates ferroelectric 
behaviour. 
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Extended Data Fig. 7 | Eliminating artefacts from scanning probe 
microscopy. a, Topography and PFM phase contrast images for ten-cycle HZO 
which did not (left) and did (right) undergo annealing after ALD deposition. 
The terraced topography in the non-annealed film indicates that the weak 
phase contrast is falsely caused by field-induced topographic changes. This is 
consistent with charge injection or ion migration, which plague amorphous 
HfO, films?°. Phase-annealed films do not display such field-induced 
topographic distortions yet demonstrate much clearer phase contrast, 
indicating the origin of PFM phase contrast incrystalline HZO films is different 
than that of amorphous HZO films. In the images shown, +7 V were appliedina 
‘box-in-box’ poling sequence. b, Time-dependent PFM phase contrast images 
onaten-cycle HZO film across a 24-h period. Inthe images shown, +7 V was 
applied in the indicated checkerboard poling pattern. c, Collapse of the PFM 
loop from V,.-series. Schematic capacitor structure probed by PFM (top) and 
piezoresponse asa function of V,,in the ‘OFF’ (V,.=0) state (bottom), 
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demonstrating the collapse of the PFM loop as V,. approaches the coercive 
voltage. This provides further confirmation of the ferroelectric origin of the 
PFM signal as opposed to tip bias-induced mechanisms**. The non-ideal shape 
of the piezoresponse loops, particularly at higher voltages, is probably caused 
by non-ferroelectric contributions from the additional dielectric SiO, layer 
through which most of the voltage is dropped (Methods). d, IDS 
switching-spectroscopy measurements on ten-cycle(1nm) HZO, 
demonstrating hysteresis for the PFM tip on-surface (top) versus no hysteresis 
for the tip off-surface (bottom). The on-surface loops indicate 180° phase 
hysteresis and butterfly-shaped d,;, indicative of ferroelectric behaviour. IDS 
PFM measurements (Methods) remove the long-range electrostatics and 
cantilever resonance artefacts that plague typical voltage-modulated PFM 
switching spectroscopy”*. This ferroelectric origin of the hysteresis is further 
supported by non-hysteretic off-surface loops”®, which probe electrostatic 
contributions. 
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Extended Data Fig. 8| High-frequency capacitance characterization of 
ultrathin HZO. a, Schematic heterostructure of ultrathin HZO on metallic TiN 
probed the microwave capacitance measurements to eliminate contributions 
from the semiconducting Si substrate. b, PFM phase contrast (left) and 
topography (right) imaging for 10 cycles HZO on TiN-buffered Si. Ultrathin 
ferroelectricity persists on top of metallic underlayers as well as dielectric SiO,, 
although the topography is rougher than the films on SiO, due to the 
inhomogeneity introduced by the sputtered TiN. c, SCM dC-dVspectroscopy 
loops taken on multiple bare regions of an ultrathin ten-cycle HZO film, 
demonstrating reproducible SCM response. The square 180° phase hysteresis 
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and dC/dVloops, which integrates into the classic butterfly-shaped 
capacitance-voltage plot (Fig. 2c), provides conclusive evidence of 
ferroelectric polarization switching beyond PFM loops (Fig. 2e, Extended Data 
Fig. 6). The microwave-frequency nature of the SCM enables leakage-mitigated 
differential capacitance measurements of ultrathin films (Methods). d, PFM 
switching-spectroscopy loops taken onthe same region of the ten-cycle HZO as 
the SCM measurements, confirming the ferroelectric-like phase and amplitude 
hysteresis. We note that the SCM and PFM switching spectroscopy was done 
using the Asylum Cypher scanning probe microscope at Asylum Research 
(Methods). 
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Extended Data Fig. 9 | Ultrathin-enhanced distortions and polar signatures 
from spectroscopy.a, Crystal field splitting diagram for the fluorite-structure 
structural polymorphs; symmetry-induced e-splitting providesa 
spectroscopic signature for the polar O-phase (Methods). b, Delineating 
symmetry-split energy regimes in oxygen K-edge XAS. Just as convergent beam 
electron diffraction provides signatures to demonstrate inversion symmetry 
breaking*’, XAS provides spectroscopic signatures to distinguish between the 
nonpolar tetragonal and polar orthorhombic polymorphs (difficult to resolve 
from GI-XRD). Left, simulated XAS spectrum for tetragonal ZrO, (P4,/nmc) and 
right, polar orthorhombic ZrO, (Pca2,), both courtesy of the Materials 
Project®’*”. The background colour shading denotes the symmetry-split 
regimes explained in the crystal field splitting diagram.c, Experimental XAS 
data on ultrathin HZO displays similar spectroscopic XAS features as the 
simulated polar O-phase (Pca2,)—namely, relative e/t, spectral weight and 
splittings corresponding to tetrahedral (A,) and rhombic (Ag) distortions. Left, 
XAS of the HZO thickness series at the O K-edge, zooming in on the e- and 
t,-regimes. Right, O K-edge spectral weight trends asa function of HZO 
thickness. The relative spectral weights from the ¢,/e and e-split regimes 
indicate enhanced tetrahedral (A,) and rhombic distortions (A,) in ultrathin 
films, respectively, consistent with C,, symmetry of the polar O-phase. 

d, Schematic representation of the cation nearest-neighbour coordination 
dropping from NN =8 (T-phase) to NN =7 (polar O-phase) as the crystal 
symmetry is lowered. The disorder in oxygen polyhedral coordination (note 


the different oxygen atoms denoted by the blue and cyan atoms in the polar 
O-phase) manifests as spectral weight in the pre-edge regime”. e, The 
experimental pre-edge spectral weight as a function of thickness, indicating 
ultrathin-enhanced polyhedral disorder. f, Top: PEEM-XLD images of ten-cycle 
(1nm) HZO at the O K-edge. Pre-edge images (left) exhibit no XLD contrast, 
while on-edge images (right)—at the energy corresponding to the 
polar-distortion split e-regime—demonstrate XLD contrast. This suggests that 
XLD is indeed sensitive to polar features in ultrathin highly textured HZO. 
Bottom, line profile of the XLD intensity, demonstrating substantial variations 
in on-edge XLD data compared to noise for pre-edge XLD. g, Crystal field 
splitting energies in HZO-related transition metal oxide systems. The material 
system, primary crystal electric field (A,), secondary crystal electric field (A,), 
and structure for various systems related to HZO and perovskite ferroelectrics 
are shown, where Ao, A,, A;and A, corresponds to octahedral, tetragonal, 
tetrahedral, and rhombic crystal electric field (CEF), respectively. The 
reference crystal electric field values are taken from the Materials Project 
database* (reference codes denoted by ‘mp’), and the experimental values are 
extracted via XAS energy-split features (b). The large tetrahedral (A,;) and 
rhombic (A,) crystal field splitting energies present in ten-cycle HZO films are 
much larger than expected values for the polar fluorite-structure ZrO, (b), 
which highlights the enhanced distortion present in ultrathin films subject to 
confinement strain, and is consistent with anomalously large structural 
distortions extracted from diffraction (Fig. 3g). 
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Extended Data Fig. 10| Ultrathin HZO ferroelectric tunneljunction.a,c, 
Tunnel current-voltage characterization of Si(p™*)/SiO,(1nm)/HZO(~1nm)/W 
capacitor devices—demonstrated for ten-cycle HZO with Hf:Zr composition 4:1 
(a) and 1:1 (c)—as a function of the write pulse (to set the ferroelectric 
polarization state). Tunnelling electroresistance behaviour is demonstrated 
for +2 Vwriteand100 mV read. Insets, linear-scale current-voltage 
characteristics of the two polarization-driven current states. b, d, Tunnelling 
electroresistance hysteresis map as a function of write voltage (demonstrated 
for ten-cycle HZO with Hf:Zr composition 4:1 (b) and 1:1 (d)) measured at 200 
mV read voltage. The abrupt hysteretic behaviour and saturating tunnelling 
electroresistance is characteristic of polarization-driven switching”, as 
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opposed to filamentary-based switching caused by electrochemical migration 
and/or oxygen vacancy motion (Methods). e, Current-voltage hysteresis 
sweeps ruling out non-polarization-driven resistive switching mechanisms 
(Methods). The device demonstrates current-voltage hysteresis at low voltage 
and voltage polarity-independent current-voltage hysteresis sense: both 
negative-positive-negative voltage polarity (left) and positive-negative- 
positive voltage polarity (right) demonstrate counter-clockwise hysteresis. 
Such behaviour rules out resistive switching mediated by dielectric breakdown 
and filamentary mechanisms“ and is consistent with polarization-driven 
switching. 
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After 50 years of development, the technology of today’s electronics is approaching 
its physical limits, with feature sizes smaller than 10 nanometres. It is also becoming 


clear that the ever-increasing power consumption of information and communication 
systems’ needs to be contained. These two factors require the introduction of 
non-traditional materials and state variables. As recently highlighted’, the remanence 
associated with collective switching in ferroic systems is an appealing way to reduce 
power consumption. A promising approach is spintronics, which relies on 
ferromagnets to provide non-volatility and to generate and detect spin currents’. 
However, magnetization reversal by spin transfer torques’ is a power-consuming 
process. This is driving research on multiferroics to achieve low-power electric-field 
control of magnetization’, but practical materials are scarce and magnetoelectric 
switching remains difficult to control. Here we demonstrate an alternative strategy to 
achieve low-power spin detection, ina non-magnetic system. We harness the 
electric-field-induced ferroelectric-like state of strontium titanate (SrTiO,)*° to 
manipulate the spin-orbit properties” of a two-dimensional electron gas", and 
efficiently convert spin currents into positive or negative charge currents, depending 
onthe polarization direction. This non-volatile effect opens the way to the 
electric-field control of spin currents and to ultralow-power spintronics, in which 
non-volatility would be provided by ferroelectricity rather than by ferromagnetism. 


‘Spin-orbitronics™ exploits the interplay between charge currents and 
spin currents enabled by spin-orbit coupling (SOC) in non-magnetic 
systems. It allows the generation of pure spin currents from charge 
currents and vice versa, without resorting to ferromagnetic materials. 
The Edelstein effect” allows charge-spin conversion” with an efficiency 
comparable to or larger than that of the spin Hall effect”. It typically 
occurs at Rashba surfaces and interfaces” where inversion symmetry 
breaking results in an out-of-plane electric field. Inthe presence of SOC, 
this leads to alocking of the momentum and spin degrees of freedom. 
The flow of an in-plane charge current in such a system produces a 
transverse spin density, which can diffuse as a spin current in an adja- 
cent material”. Conversely, injecting a spin density results in the pro- 
duction ofanet charge current by the inverse Edelstein effect’. As such, 
Rashba systems can be used as spin generators and detectors. However, 
the conversion rate is inherently set by the electronic structure, and 
cannot be switched by an external stimulus. 

The order parameter of ferroelectrics (polarization) can be switched 
by anelectric field for energy costs typically 1,000 times smaller? than 
those for switching ferromagnets. Moreover, ferroelectrics can harbour 
intense electric fields, substantially modifying the carrier densities in 
adjacent materials, and thereby tuning their properties inanon-volatile 
fashion. An exciting route towards low-power electronics would thus be 


to combine the remanence of ferroelectrics with the ability to generate 
and manipulate spin currents by the direct and inverse Edelstein effects 
in Rashba systems. Beyond magnetoelectricity, ferroelectric Rashba 
architectures would therefore offer anew approach to the non-volatile 
control of spin currents by electric fields, with ultralow-power opera- 
tion. 

Most efforts to identify single-phase Rashba ferroelectrics’ have 
focused on GeTe (ref. ”). However, ferroelectric properties are poor” 
because of high leakage, and spin-charge conversion experiments 
have yielded a moderate efficiency”. Here we show that beyond bulk 
materials, interface systems combining Rashba SOC and a switchable 
polarization enable the non-volatile electric control of a highly efficient 
spin-charge conversion. 

The general concept of ferroelectricity-controlled spin-charge con- 
version is described in Fig. 1. At the interface between a ferroelectric 
and anultrathin SOC system—suchasa heavy metal, a Weyl semi-metal, 
oratwo-dimensional electron gas (2DEG)—electrons are accumulated 
or depleted depending on the polarization direction (Fig. 1a). This 
modifies the electric field in the interface region, and in the ideal 
case changes its sign. If a Rashba state is present in the SOC system at 
the interface with the ferroelectric, reversing the sign of the local 
electric field reverses the chirality of the spin textures in both split 
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Fig. 1| Concept of ferroelectricity-controlled spin-charge conversion. 

a, Sketch ofa ferroelectric Rashba architecture combining a ferroelectric 
material (green) and a material with spin-orbit coupling (SOC; purple). Upon 
switching polarization, electrons are accumulated (left) or depleted (right) in 
the SOC material (for example, a 2DEG), creating an electric field whose sign 
depends onthe polarization direction. b, Corresponding Rashba-split chiral 
Fermi contours with spin-momentum locking. The chirality of the contours 
switches upon switching the ferroelectric polarization. k,and k, correspond to 
thexand yaxes in momentum space. Blue and red colours indicate the spin 
direction (blue, along k,; red, along -k,). c, Inverse Edelstein effect ina Rashba 
interface. Whenaspincurrent is injected (for example, by spin pumping) witha 
spin polarization along the yaxis, the spin population is altered, causing a 
displacement of the two inequivalent Fermi surfaces (red and blue lines) by +Ak 
in momentum space. This results ina net charge current generated 
perpendicularly to the spin current and to its spin polarization. The sign of the 
generated current depends onthe chirality of the Fermi contours and is thus 
reversed upon switching ferroelectric polarization. 5s, corresponds to the 
injected excess of spin-up density. d, Non-volatile device operated by 
ferroelectricity and Rashba SOC. Through the inverse Edelstein effect a charge 
current/,is generated by the conversion of aspin current/, injected from the 
ferromagnet. The sign of /, changes with the direction of the ferroelectric 
polarization. The large white arrows show the ferromagnet magnetization, the 
small black arrows the spin o, the brown arrows the direction of the charge 
current/,, and the red arrow the direction of the spin current/,. The small 
black-and-white arrows correspond to the ferroelectric polarization. 


Fermi contours (Fig. 1b). Through the inverse Edelstein effect”, the 
injection of a spin current into the Rashba state will produce a charge 
current/, whose sign will depend on the ferroelectric polarization state 
(Fig. 1c). This mechanism offers the possibility to design a wealth of 
devices suchas the bipolar memory proposed in Fig. 1. It canalso be the 
basis of logic devices” akin to the magnetoelectric spin-orbit (MESO) 
device proposed by Intel”, but without resorting to a multiferroic to 
switch the ferromagnet. 

To experimentally demonstrate the non-volatile electric control of 
the spin-charge conversion, we use SrTiO, (STO) 2DEGs, generated 
by the deposition of a film of Al onto a STO single crystal**”’. Indeed, 
STO 2DEGs exhibit a sizeable Rashba SOC” with a very high conver- 
sion efficiency”>”°. In addition, STO is a quantum paraelectric that 
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develops anelectric-field-induced switchable polarization at low tem- 
perature’ ”. 

The spin-to-charge conversion was measured by using spin pump- 
ing by ferromagnetic resonance on a NiFe(20 nm)/Al(0.9 nm)//STO 
sample (see Fig. 2a inset). The nominally 500-ym-thick STO substrate 
was thinned down to 250 + 20 um using mechanical polishing, allow- 
ing the application of high electric fields (£). A static magnetic field 
was applied along the y direction. At the ferromagnetic resonance, a 
pure spin currentis injected into the 2DEG along the -z direction, with 
spins oriented along y (ref. 7”). The measurement of the extra damp- 
ing due to this relaxation channel allows calculation of the injected 
spin current”*”’”, In the 2DEG, this spin current is then converted intoa 
charge current oriented along x by the inverse Edelstein effect. Since 
the sample is in open circuit, at the resonance field this results ina 
voltage drop along the sample, in the x direction”®. 

Inthe pristine, ungated state, the voltage drop obtained at resonance 
corresponds to the production of a positive normalized current of 
1.2A mT? m1 (top left panel of Fig. 2b). At low temperature, STO is 
known to undergo a phase transition at high electric field” °: once a 
large electric field has been applied, the material develops a switchable, 
remanent polarization. This phenomenon is often referred to as a 
field-induced ferroelectric order or a field-induced ferroelectric-like 
state. We applied voltages up to +200 V, corresponding to F up to 
+8kV.cm, high enough to achieve this phase transition”. After a first 
initialization cycle [+200 V;-200 V; +200 V], the gate-voltage depend- 
ence of the spin-pumping signal shows a hysteretic behaviour (Fig. 2a). 
The charge currents produced at ferromagnetic resonance have oppo- 
site signs for +200 V and —200 V gate voltages, as seen in points B, 
Fand D of Fig. 2a and b. After applying the maximum voltage, the nor- 
malized current reaches a very high amplitude (+8.8 AmT *m"), beyond 
the record values obtained previously in LaAlO;/STO and Al/STO sam- 
ples (around5 AmT?m_")*. The spin-charge conversion efficiency is 
quantified by the inverse Edelstein length A,,,, equal to the ratio of the 
2D charge current density produced by the injected 3D spin current, 
thatis, Age =J/2” //2(ref.”; see Methods). Here we estimate A,-,~+60 nm, 
a value one to two orders of magnitude larger than in metallic Rashba 
interfaces” or topological insulators”. 

We note that the produced current—and thus the spin-charge con- 
version rate—is remanent at gate voltage V,,.. = 0 V, as seen in C andE 
in Fig. 2a and b. Similar hysteretic behaviours have been obtained on 
several thinned-down samples but not on a 500-um-thick STO sub- 
strate, which indicates the existence of a critical electric field for the 
hysteresis to appear. The non-volatile control of the spin-charge con- 
version is further evidenced by Fig. 2c, which displays the normalized 
charge current produced at O V after the application of 500-ms pulses 
of +200 V gate voltage. Figure 2d shows the temperature dependence 
of the difference A/. in the produced current obtained at remanence 
after applying pulses of +200 Vand -200 Vat 7K. A/. is large below 30K 
and vanishes above 45-50 K, suggesting a transition of STO into the 
paraelectric phase’ °. Extended Data Figs. land 4 show that the effect 
is reproducible and stable in time for at least several hours. 

We have also performed electric polarization measurements ona 
Al(1.8 nm)//STO 2DEG sample with a STO thickness of 200 + 20 tm. As 
visible in Fig. 3a, the application of an electric field up to 2.5kV cm™ 
(green curve) yields alinear dependence of the polarization with F, as 
expected for a dielectric. However, when the voltage exceeds about 
7 kV cm, a hysteresis develops, associated with switching current 
peaks in the/ versus F data (Fig. 3a inset). The saturation polarization 
is about 4 1C cm”, in agreement with earlier reports’. Upon increas- 
ing the temperature (Fig. 3c), the loop progressively closes, indicat- 
ing a Curie temperature close to 50 K (Fig. 3b). This almost coincides 
with the temperature at which the remanent spin-charge conversion 
effect vanishes (Fig. 2d), strongly suggesting that the switchable 
polarization is at the origin of the hysteretic inverse Edelstein effect. 
At low temperature, reducing F to below the critical value still yields 
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Fig. 2 | Electric-field-controlled spin-charge conversion with electrical 
remanence. a, Gate-voltage dependence of the normalized current produced 
by the inverse Edelstein effect. A-F, /, conditions examined in b. Inset, asketch 
of the heterostructure. b, Magnetic-field dependence of the normalized 
current produced in spin-pumping experiments, for different voltage values 


hysteretic polarization loops, albeit with a lower remanent polariza- 
tion (Fig. 4a). 

One of the hallmark features of STO 2DEGs is the strong gate-voltage 
dependence of the sheet resistance”? R,. In thick STO samples the 
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Fig. 3 | Electric polarization measurements. a, Polarization versus electric 
field curves measured ona Al(1.8nm)//STO sample. The green curve 
corresponds tothe polarization loop measured with a maximum field of 


Temperature (K) 


(seea).c, Normalized charge current produced at electrical remanence after 
applying positive or negative voltage pulses of +200 V. All data have been 
measured at 7 K.d, Temperature dependence of the difference between the 
remanent normalized currents after the application of alarge positive or 
negative voltage. 


gate dependence of R, is usually non-hysteretic”, in line with the par- 
aelectric nature of STO at low electric fields. Here, as seen in Fig. 4b, 
R,; varies as the carrier density varies, but this dependence also 
exhibits a clear hysteresis, allowing the non-volatile electric control 


a 
£ 
[s) 
1S) 
S 
5 -12 
S 
oO 
iN 
y 
5 
[e) 
jae 


LB 


Electric field (kV cm) 


2.5kV cm‘ Inset, corresponding current versus electric field curve. 
b, Temperature dependence of the remanent polarization P,.c, Polarization 
loops at different temperatures. 


Nature | Vol580 | 23 April2020 | 485 


Article 


a [T T eT T T T T ‘5 b 304 T T | aaa | T T a aa i 
144 4 
12+ Z 25+ 
© 10 | 
5 20+ | 
Za} i 
re 
el. | = 15 : 
N a 
is] 
GS yl ] 
© 10+ 4 
Qe 4 
5 + 
OF =f 


A? Wott Obert 


8-6-4202 4 6 8 -8-6-4-20 2 46 8 
Electric field (kV cm) Electric field (kV cm“) 


Fig. 4| Field effect experiments. a, Polarization loops at 2 K measured inthe 
field-induced state for different increasing maximum electric fields. The 
curves are shifted by 2 ;Ccm” for clarity. b, Gate dependence of the 2DEG sheet 
resistance R, for different maximum electric fields at 2K. The curves are shifted 
by 3kQ for clarity. 


of the 2DEG electronic properties. We note that the hysteresis ampli- 
tude increases upon increasing the maximum £, so that the R, versus 
E loops mimic the polarization loops of Fig. 4a. Hall measurements 
made in the two remanent states yield a difference in carrier densi- 
ties An, = 5.45 x 10” cm”, only two times smaller than the theoretical 
value An, = 2P,/e = 1.13 x 10° cm™ (using the remanent polarization 
P.=0.9 uC cm”); this corresponds to a notable efficiency compared 
to the literature”. Note that we have also performed R; versus E 
loops on spin-pumping samples, which possess a NiFe layer, showing 
that the obtained loops are very similar to the Ag versus Va. loops 
(see Extended Data Fig. 1and Methods). 

Several mechanisms may be invoked to explain our observation ofa 
hysteretic inverse Edelstein effect. One can be related to the descrip- 
tion of Fig. la, namely, alocal inversion of the electric field inthe SOC 
material (here the 2DEG) promoting polarization-direction-dependent 
Rashba SOC and spin-charge conversion. Additionally, electronic 
structure effects may be at play, since the multiorbital band struc- 
ture of STO 2DEGs is known to produce effective Rashba effects with 
opposite signs, depending on the orbitals involved*>”°. Moreover, 
the presence of a switchable polarization with associated polar dis- 
placements of cations and anions should substantially modify the 
band structure compared to the paraelectric case. This may generate 
additional (avoided) band crossings, possibly with non-trivial topol- 
ogy”, leading to super-efficient spin-charge conversion. 

Our results constitute the basis of a new type of spintronics, in 
which non-volatility would not originate from ferromagnetism but 
from ferroelectricity. They could be extended to room temperature by, 
for example, designing 2DEGs on strained STO thin films® or BaTiO, 
(ref. 7*). This could open the way to a new class of ultralow-power spin- 
orbitronic devices (such as memories, spin field-effect transistors, 
spin Hall transistors or MESO-like logic devices). In the future, demon- 
stration ofa non-volatile electric control of the direct Edelstein effect 
could additionally lead to reconfigurable spin-orbit torque memo- 
ries and logic gates, be of benefit to the manipulation of skyrmions or 
domain walls, and allow the development of agile terahertz emitters 
and spin-wave logic architectures. 
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Methods 


Sample preparation 

NiFe and Al films were deposited at room temperature by d.c. mag- 
netron sputtering on TiO,-terminated (001)-oriented STO substrates 
(from CrysTec). TiO,-termination was achieved through a chemi- 
cal treatment, where the substrate was submerged in a buffered 
hydrofluoric acid (NH,F-HF 7:1) for 30 s and annealed under an 
oxygen-rich environment at 1,000 °C for 3 h. Before deposition, the 
STO substrates were additionally annealed at 730 °C for 2h under a 
partial oxygen pressure of 400 mbar. The deposition of the metallic 
layers was performed under an Ar partial pressure of 4.5 x 10 * mbar and 
asubstrate-to-target distance of 7cm. The samples including NiFe were 
additionally capped with a 2.5-nm layer of Al, which becomes oxidized 
when exposed to air. Samples were mechanically polished on diamond 
pads under deionized water flow. 


Spin pumping 

The spin-pumping experiments were carried out using a Bruker 
ESP300E X-band CW spectrometer at 9.68 GHz, with a loop-gap 
Bruker ER 4118X-MSS cavity, and using a microwave power of 5 mW 
or less to remain in the linear regime. The generated d.c. voltage was 
measured using a Keithley 2182A nanovoltmeter. The gate voltage was 
applied using a Keithley 2400 sourcemeter. The measured signals were 
observed to be linear with the r.f. power up to 5 mW. 


Calculation of the inverse Edelstein length 

The inverse Edelstein length /,,; is the figure of merit quantifying the 
efficiency of the spin to charge current conversion. It has the dimension 
of alength, as the 3D spin current iP (in Am”) is converted into a 2D 
charge current Je (inAm”): 


Ace = 73D (1) 


Both ye and der need to be evaluated to calculate the inverse 
Edelstein length. Here we use the method already described in previous 
works (for example, on LAO/STO (ref. 7°) or HgTe (ref. 78). 

The produced charge current is simply extracted from the symmet- 
ric component of the measured spin signal V,,,,: 


V, 
te ai (2) 


where Ris the resistance of the sample (measured independently), and 
wis the sample width (400 pm). 

Note that here, the produced current Joe is used to give the ampli- 
tude of the spin signal, as it can be considered as raw data. In order 
to have values comparable from measurement to measurement, 
especially with experiments found in the literature, and as the 
spin signal varies linearly with the square of the excitation field 
Hoh (hy is the radio-frequency field), the current production 
has to be normalized. Thus, the produced current given inthe main 
text is actually J2° /(up/y,)”, in A mT m". The radiofrequency field 
for a given measurement is measured using the cavity conversion 
factor. 

The spin current is extracted using the spin-pumping theory first 
developed by Tserkovnyak, Brataas””* and co-workers, and then by 
several other groups***. The spin current injected at the ferromag- 
netic resonance can be obtained by measuring some of the magnetic 
properties of the ferromagnetic layer, and by calculating the effective 
spin mixing conductance: 


a ref) (3) 


where pl, is the Bohr magneton, ¢,,, the thickness of the ferromagnetic 
material (20 nm here), M, the saturation magnetization of the Permal- 
loy thin film, g its g-factor, a its Gilbert damping, and a,,;the Gilbert 
damping of a Permalloy thin film without a spin-sink (here Permalloy on 
native Si). All these values are extracted from independent ferromag- 
netic resonance (FMR) measurements, using either broadband-FMR 
or out-of-plane angular dependence measurements. 

Then, using the expression of the spin mixing conductance we can 
obtain the injected spin current: 


po = Gixy7hp hie| 4M + | (4M,y)? + 4a? |2e 


s 81a? (4mM.y)? + 407 h 


(4) 


where y is the gyromagnetic ratio, w the angular frequency, e the 
elementary charge and f the reduced Planck constant. The inverse 
Edelstein length can then be obtained by combining equations (1), 
(2) and (4). 


Gate-voltage dependence reproducibility 

We have performed spin-pumping measurements on different samples 
of NiFe(20 nm)/AI(0.9 nm)//STO at 7 K, see Extended Data Fig. 1. Sample 
lis taken from a first batch, whereas samples 2 and 3 are two different 
samples from the same second batch. The results shown in the main 
text have been measured on sample 3. After thin film deposition onSTO 
substrates, the samples were all thinned down to the same thickness 
(250 + 20 pm). As can be seen in Extended Data Fig. 1, for these three 
samples similar gate-voltage dependences have been obtained, with 
a hysteretic behaviour, a positive or negative remanent spin-signal at 
Veate= 0 V, and large conversion efficiencies. The obtained inverse Edel- 
stein lengths /,,; are above 40 nm inall three cases, and up to60 nmin 
the case of sample 3. The error bars are mostly due to the uncertainty 
onthe effective spin mixing conductance. The main results presented 
in the text are thus reproducible, even though the samples have been 
thinned down using mechanical polishing. 

We have also performed several cool-downs on the same sample. 
After performing a first cool-down and some gate dependence meas- 
urements at lowtemperature, it is possible to recover the initial state by 
heating up the sample at room temperature. As can be seen in Extended 
Data Fig. 2 (measured on sample 2), the remanent ferroelectric state 
is lost after heating, but when going back to 7 K the sample recovers 
the initial state, with a lower and positive spin signal. This is consist- 
ent with our observation of a voltage-induced ferroelectricity at low 
temperature. After heating at room temperature, an initialization loop 
[+200 V; -200 V; +200 V] performed at lowtemperature allows retrieval 
of the hysteretic behaviour and the remanence of the polarization. 


Time stability of the remanent state 

In the main text we show that a +200 V gate-voltage application at 7 K 
allows the spin-charge conversion to be controlled ina remanent way. 
To demonstrate the non-volatility associated with this remanence, we 
performed spin-pumping measurements hours after applying a gate 
voltage of either +200 V or -200 V for 500 ms. As seen in Extended 
Data Fig. 3, the produced normalized current is preserved, remaining 
unmodified after several hours. 


Electric polarization measurements 

Inthese experiments, atriangular waveform was applied at a frequency 
of 1kHz across the STO, between the 2DEG and a bottom electrode of 
Ti/Au, and the current/was measured in real time. Integrating the current 
over time and normalizing by the sample area yields the polarization. 


Magnetotransport 
Low-temperature electrical transport measurements were performed 
on the thinned samples bonded by AI wires in the van der Pauw 
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configuration using a standard a.c. lock-in technique (/,,= 200 nA, 
fac = 77.03 Hz) ina Quantum Design Dynacool cryostat at a tempera- 
ture of 2 K and magnetic fields between —9 T and 9 T for the Hall 
resistance study. Before any back-gate voltage data were recorded, 
the samples were subjected to a so-called forming step” at 2 K, 
where the back-gate voltage were cycled several times (>2) between 
the gate-voltage extremes of the particular gate-voltage interval to 
ensure no irreversible changes would occur in the interface system 
upon application of the back-gate voltage in the actual experiment. 
Note that this low-temperature forming step was repeated following 
all occasions the sample was brought above 105 K. Moreover, at each 
new cool-down, the samples were always cooled with the back-gate 
electrostatically grounded. 


R-Vloops measured on NiFe/AI/SrTiO, samples 

Extended Data Figure 3 shows R-V loops measured on the NiFe/Al/ 
STO sample used for spin pumping. The R-V and /.-V loops have 
rather similar shapes, indicating a similar origin for both hysteresis 
effects. The observed two-probe resistance variation of ~0.27 Qin this 
0.4mm x 2.4mm NiFe(20 nm)/AIO,//STO sample is compatible with the 
R-Vfor anAlO,//STO sample shown in Fig. 4b. The room-temperature 
sheet resistance of the NiFe(20 nm)/AIO,//STO sample is roughly that 
of the NiFe layer, and equal to 9 Q. In Fig. 4, gating results in a change 
of the 2DEG sheet resistance from about 1.7 kQ to 23.5 kQ. Ina simple 
parallel model of the NiFe(20 nm)/AIO,//STO sample (in which current 
flows in parallel inthe NiFe and the 2DEG), gating should thus result in 
a sheet resistance change of 


RepEGpNike ReDEGpNike 


AR= RBDEG + piiFe re, RBDEG + pNiFe Ms. 
23,500x9 1,700x9 
23,500+9 1,700+9 
= 0.044 0 


corresponding to an expected two-probe resistance change of 0.260, 
in excellent agreement with the observed change of 0.27 QO. VG- and 
VG+ indicate negative and positive gate voltages, respectively. 

The shape of the P-Vand R-Vloops of Fig. 4 is different from that 
of the /.-Vand R-V data of Extended Data Fig. 3. One reason is that 
the spin-pumping experiments were performed ona Al//STO sample 


covered with a NiFe layer to perform the spin injection, whereas the 
resistance versus electric field (R-E) and polarization versus electric 
field (P-E) loops were performed on Al//STO samples without NiFe and 
thus witha different electrostatic geometry. Additionally, the sample 
dimensions are also different for the two sets of experiments. In the 
spin pumping FMR experiments, the STO thickness is 250 pm, and the 
lateral size is 0.4 mm x 2.4 mm. For the R-F and P-F loop experiments, 
the STO thickness is 200 pm and the lateral size is 5 mm x 5 mm. Finally, 
the SP-FMR samples are cut from plain samples, which could induce 
defects modifying the coercivity. We believe the observed discrep- 
ancy between loops to arise primarily from these above-mentioned 
differences. 


Data availability 


The data that support the findings of this study are available from the 
corresponding authors upon reasonable request. 
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Extended Data Fig. 1| Gate-voltage dependence of the inverse Edelstein effective spin mixing conductance Geis ranging from1.2nm7to3.2nm? with 
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Extended Data Fig. 2 | Spin-pumping signals obtained at 7 K on sample 2, for three different cool-downs from room temperature. After each cool-down, the 
signal was measured before any gate-voltage application. 
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Extended Data Fig. 3 | Spin-pumping and resistance loops ofa NiFe/Al/STO sample. Black data points, two-probe resistance R of a NiFe/Al/STO sample, 
measured in the spin-pumping setup as a function of the back-gate voltage. Red data points, normalized charge current production (/,) measured by spin pumping. 
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Extended Data Fig. 4| Dependence of the produced current on the time spent after application of a positive or negative gate voltage. Black squares, +200 V; 
red circles, -200 V. The measurements were performed at 7 Konsample1. 
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From rock salt to nanoparticle superlattices, complex structure can emerge from 
simple building blocks that attract each other through Coulombic forces’ *. On the 
micrometre scale, however, colloids in water defy the intuitively simple idea of 
forming crystals from oppositely charged partners, instead forming non-equilibrium 
structures such as clusters and gels’. Although various systems have been 
engineered to grow binary crystals®™, native surface charge in aqueous conditions 
has not been used to assemble crystalline materials. Here we form ionic colloidal 
crystals in water through an approach that we refer to as polymer-attenuated 
Coulombic self-assembly. The key to crystallization is the use of a neutral polymer to 
keep particles separated by well defined distances, allowing us to tune the attractive 
overlap of electrical double layers, directing particles to disperse, crystallize or 
become permanently fixed on demand. The nucleation and growth of macroscopic 
single crystals is demonstrated by using the Debye screening length to fine-tune 
assembly. Using a variety of colloidal particles and commercial polymers, ionic 
colloidal crystals isostructural to caesium chloride, sodium chloride, aluminium 
diboride and K,C,, are selected according to particle size ratios. Once fixed by simply 
diluting out solution salts, crystals are pulled out of the water for further 
manipulation, demonstrating an accurate translation from solution-phase assembly 
to dried solid structures. In contrast to other assembly approaches, in which particles 
must be carefully engineered to encode binding information” “, polymer-attenuated 
Coulombic self-assembly enables conventional colloids to be used as model colloidal 


ions, primed for crystallization. 


To assemble ionic colloidal crystals in water, we employ a twist to Der- 
jaguin-Landau-Verwey-Overbeek (DLVO) theory” that balances the 
electrostatic attractive force between oppositely charged particles 
with steric repulsion from well-defined polymer brushes between them. 
Attractionis provided by overlapping electrical double layers, that is, 
clouds of oppositely charged ions surrounding charged particles. The 
thickness of the double layer is characterized by the Debye screening 
length A,, which sets the range of attraction. The polymer brush serves 
as a particle spacer and its purpose is twofold: first to prevent particles 
from entering the van der Waals region, and second to set the overlap 
of the electrical double layers. Particle separation regulates the ampli- 
tude of the electrostatic attraction, effectively forming anionic bond 
whose strength can be tuned through ,. This ideais illustrated in Fig. 1a, 
where we consider the contribution of the electrostatic potential V, 
(dotted) between overlapping oppositely charged double layers and 
the repulsive potential V, (dashed) between polymer brushes. As we 
typically consider particle spacers of thickness larger than 6 nm, 
particles do not come in close contact and we can neglect van der 
Waals interactions. V, is obtained as for DLVO theory, but consider- 
ing particles of opposite surface potentials YW, and ¥. This 
gives: we = 2nery,p exp(—h/Ap), where ris the radius, his the surface- 
to-surface particle separation, k, is the Boltzmann constant, Tis tem- 
perature and cis the solvent permittivity”. The repulsion between the 
polymer brushes, V,, is given by the Alexander-de Gennes polymer 
brush model (Methods), built on scaling arguments and predicting 


forces similar to more advanced theoretical treatments”. Superposition 
of the two yields the pair potential V, between oppositely charged 
particles (solid lines), which exhibit alocal minimum conveniently tuned 
by varying A, relative to the thickness of the polymer spacer L. The depth 
of the minima corresponds to the bond energy £, between oppositely 
charged particles. For A, <<L, the polymer brush prevents the double 
layers from overlapping, the electrostatic interaction vanishes and the 
particles are sterically stabilized. For A, >>L, the presence of the polymer 
brush is negligible, the double layers fully overlap, and the particles 
aggregate. For A, comparable to L, repulsion from the brush and elec- 
trostatic attraction nicely balance to establish an ionic bond F£, of afew 
k,T. Here the particle spacers maintain separation between oppositely 
charged particles while attracting one another electrostatically, provid- 
ing cohesion with reconfigurablility—an indispensable condition for 
dependable assembly. Qualitatively, the equilibrium distances prove 
to be close to the contact point between the two brushes, h = 2L, as for 
anearly impenetrable wall, which provides an intuitive handle to both 
rationalize the interaction and streamline the experimentation. 
Following this simple principle, the assembly of an ionic colloi- 
dal crystal requires only three ingredients: (1) oppositely charged 
particles, (2) anon-ionic surfactant that forms a uniform brush onto 
the surface of the particles and (3) salt to set the range A, of the elec- 
trostatic interactions. Ina prototypical self-assembly experiment, 
a block copolymer (Pluronic F108) adsorbs onto a set of oppositely 
charged polystyrene (PS) microspheres (r=500 nm), forming abrush. 
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The colloids are then equilibrated in sodium chloride (NaCl) solu- 
tions and simply mixed together. After mixing, the fate of the binary 
suspension is set by the salt concentration in the system. We observe 
three distinct assembly behaviours that are consistent with the relative 
bond energies calculated using our model. At low salt concentrations 
(<4.3 mM), F, is much larger than the thermal energy k,7, causing the 
particles to bind irreversibly. What follows is a catastrophic floccula- 
tion that yields the macroscopic heteroaggregates shown in Fig. 1b. 
Increasing the salt concentration (>4.9 mM) reduces F,, leading toa 
less intuitive scenario whereby oppositely charged particles coexist 
inastable suspension (Fig. 1c). In between these two regimes, we find 


Fig. 1|Polymer-attenuated Coulombic self-assembly. a, Oppositely charged 


potential (V,») comprises polymer repulsion (V,) and Coulombic attraction (V;). 


value of Ap, the corresponding V,, minima represent the ionic bond strength&,. 
b-d, Fromleft to right: bright field microscopy, confocal microscopy and 


computer simulations of oppositely charged F108-grafted PS spheres at 
Ap = 6.0 (b), 3.0 (c) and 4.5 (d). Scale bars, 4 um. 


a narrow window of salinities in which oppositely charged particles 
behave as model ions, self-assembling into perfectly ordered ionic 
solids (Fig. 1d). The emergence of vibrant structural colours within a 
sample (Extended Data Fig. 1, Supplementary Video 1) easily reveals 
this ‘Goldilocks zone’, which—assuming a brush length Z = 10 nm for 
Pluronic F108—is characterized by bond energies of about 8k,7. When 
left undisturbed, the oppositely charged particles condense within 
hours from mixing, forming millimetre-sized crystalline solids ina 
matter of days. Particles with stronger attraction initially produce gels 
that can anneal over time into crystalline phases (Extended Data Fig. 2). 
Although the resultant crystal domains are smaller than the ideal case, 


a -PPO- 4 b 
‘ Ds \ Like charge 
F108 FA a a a a 
140+ oa a PS-sulfonat -amidi 
[ O O O E Pe te th L ° eau onate OPS-amidine | 
£ 120+ we & 500° s 35 a e oe eo A | 
5 F127 rat < Eo pH 10.5 © 
9 100 > &C O O OGPAA A ~,-1,000) ® _35| ° | 
a F68 a 79 a, 5% 
- oF @: O COM AA A eS -1,500 oo eae 10 12 2 ° Jee 
2 a a © Dispersed 5 Obi d 
L - | isperse: 
3 F38 Y @ Crystallized -2,000 - i kgi] e . lized | 
ae o | rystallized | 
4o- C¥ OGBrAA Se dace : © OO ooANN ‘ 
-2,500 || 59 pH 7.0 | A Aggregated § 
20t4 n n n n n n n (ieee L a i = rel 
0 1 2 3 4 5 6 7 2 3 4 5 6 7 
Ap (nm) 


Fig. 2| Tunable crystallization conditions. a, Experimental phase diagram 
showing the assembly behaviour of oppositely charged PS asa function of A, 
for different brush lengths. Polymer brushes are installed by grafting or 
physisorption of PEO-PPO-PEO triblock copolymers. The polymer 
architecture comprises an anchoring block (PPO) anda variable spacer block 
(PEO). Error bars (+1s.d.) of the data points are smaller than the marker size. 
The dotted lineis a linear fit between crystalline points. b, Assembly behaviour 
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of oppositely charged F108-PS colloids as a function of A, for different pH 
values. The colour gradient corresponds to the ionic bond energy F, calculated 
assuming a brush lengthZ =10 nm. The pH affects the surface potentials 
(W_and Y,, inset) of the particles, whichin turn set the amplitude of the 
electrostatic attraction. Error bars are +1s.d. Crystallization occurs alonga 
constant energy line (dotted) across a broad range of pH values. When the 
opposite charge is lost (here at pH 11.3) the suspension remains dispersed. 


0.5 mm 


Fig. 3 | Fixed crystals. a, Fixed ionic colloidal crystals can be safely dried and 
manipulated. Left, a collection of millimetre-sized iridescent monocrystalline 
solids embedded ina matching refractive index epoxy resin (Supplementary 
Video 4). Middle and right, optical (middle) and SEM (right) micrographs 
showing the characteristic rhombic dodecahedral habit of a fixed CsCl crystal. 
b-e, Electron microscopy analysis reveals the exact structure of aseries of 
ionic crystals formed at different B values, including CsCl (b), AIB, (c), KyCgo (d) 


this demonstrates that even when particles seem to aggregate chaoti- 
cally, they remain sterically stabilized and continue to reconfigure, 
albeit slowly. In addition to direct visual clues, the nucleation and 
growth of ionic solids can be followed in much greater detail by in situ 
confocal microscopy, which allows differentiation between positive 
and negative species using two different fluorescent dyes (Fig. 1b-d, 
centre). Molecular dynamics simulations (Methods) further support 
our simple interaction model by revealing assembly behaviour that 
closely matches the experimental observations (Fig. 1b-d, right, Sup- 
plementary Video 2). 

We perform crystallization experiments varying every ingredientin 
the system, including type of particle spacer, solvent conditions and 
building blocks. First, we explore the polymer-attenuated Coulombic 
self-assembly (PACS) energy landscape by systematically changing the 
length of the spacers. We select various polymer brush thicknesses 
through a series of Pluronic surfactants—namely F108, F127, F68 and 
F38, listed in decreasing length. For each spacer, we rationally select 
the screening length to reach £, of approximately 8k,7 based on calcula- 
tions from our model, and then fine-tune the experimental conditions 
until crystals nucleate. As expected, particles with shorter spacers 
require higher salt concentrations (that is, smaller values of Ap) to repro- 
duce the same self-assembly behaviour observed for longer spacers. 
We found a direct correlation between the spacer length and the values 
of A, that cause crystallization, such that a constant ratio between the 
two produces equivalent interaction strengths (Fig. 2a). This confirms 
the simple argument that well-defined separation between oppositely 
charged particles allows for tunable electrostatic assembly. The linear 
relationship is clear across the three longest polymers, whereas the 
shortest polymer F38 fails to produce crystals, which we attribute to 
displacement flocculation because of its weak anchoring. We further 
test the robustness of PACS by assembling crystals under different pH 
conditions, as this strongly affects the native charge of the colloids. 


and NaCl (e). Scale bars, 1 pm. f, SEM image showing rhombic dodecahedral 
CsCl crystallites. Scale bar, 3 um. g, Left, optical micrograph of needle-like AIB, 
crystals imaged through crossed polarizers. Scale bar, 100 pm. Middle, 
bright-field micrograph showing a growing AIB, needle. Scale bar, 5 um. Right, 
SEM image of fixed AIB, needles. Scale bar, 20 pm. h, SEM images of an 
exfoliated K,C,. bulk crystal. Scale bars, 10 pm. 


Figure 2b shows that as we move away from neutral pH, one particle 
type rapidly loses charge—either the positive in basic conditions or the 
negative in acidic conditions. As the charge vanishes, the amplitude 
of the electrostatic attraction decreases, causing the crystals to disas- 
semble. Increasing values of A, can compensate for the electrostatic 
decay, thus resulting in suspensions that crystallize at lower salt con- 
centrations. By applying this criterion, crystals assemble over abroad 
range of pH conditions, with limits that are set only by the isoelectric 
points of the particles, beyond which the opposite charge is lost and 
electrostatic attraction vanishes. Finally, we mix and match particles 
of different compositions to demonstrate that PACS is not limited to 
PS microspheres, but applies to virtually any water-based colloidal 
system. Silica, 3-(trimethoxysilyl) propyl methacrylate (TPM) and 
PS particles with sizes ranging from 200 nm to 2 pm all successfully 
crystallize. In particular, the versatility of the method is best illustrated 
by the formation of hybrid crystals that incorporate both solid and 
liquid components (Supplementary Video 3). 

The carefully balanced interactions that are required for colloidal 
self-assembly are often incompatible with any chemical or mechani- 
cal perturbation, requiring specific fixing mechanisms, ultimately 
limiting yield and scope”. By contrast, PACS interactions establish 
interparticle bonds witha convenient and general self-locking mecha- 
nism that bypasses this issue entirely. This distinctive feature is eas- 
ily understood by considering the effect of removing salt on the pair 
potential of the particles. Because of the simple relationship between 
A, and bond strength, crystals can be first assembled in salty water and 
then allowed to gradually harden by dilution or dialysis. In particular, 
we find that under deionized conditions, crystals become fixed solids, 
at which point they can be handled in solution and dried while retaining 
crystalline order. This process is irreversible, and crystals do not disas- 
semble if salt is added back to the suspension, indicating permanent 
binding due to van der Waals interactions. Through this self-locking 
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Fig. 4| Heterogeneous nucleation of crystals. a, CsCl crystals growing ona 
negatively charged glass substrate. From left to right: (100), (111) and (110) 
planes are imaged by SEM (top) and confocal microscopy (bottom). 
False-coloured white and red particles highlight positive and negative species, 
respectively. Scale bars, 1m. On top of the SEM images, the schematics 
illustrate each plane relative to the unit cell of the crystal. b, Each crystal plane 
templates the growth ofa macroscopic crystal with a characteristic shape. 
Crystals are captured by optical microscopy (top) and SEM (bottom). Scale 
bars, 1pm. Renderings show the three-dimensional arrangement of the 
colloidal ions. c, Optical microscopy image showing crystals growing ona 
negatively charged glass substrate and the relative abundance of eachcrystal 
shape. Negative substrates favour the nucleation of crystal planes witha high 
planar density of positive particles. Scale bar, 40 pm. The inset shows the 
regular shape of asingle crystal imaged by bright-field microscopy. d, An 
increased particle-substrate bond energy leads to the nucleation of only (100) 
planes. Left, bright-field micrograph of monodisperse pyramid-like crystals. 
Right, an SEM image of the fixed crystals, fully revealing their 
three-dimensional microstructure. Scale bars, 20 1m. 


mechanism, we are able to observe that macroscopic colloidal ionic sol- 
ids growas single crystals and develop characteristic habits that resem- 
ble those of their atomic counterparts (Fig. 3, Extended Data Fig. 3). 
Dried products can be transferred to new media, such as matching 
refractive index liquids or flexible epoxy resins (Fig. 3a, Supplemen- 
tary Video 4). 

The structure of an atomic crystal varies according to the size of the 
constituent ions. Similarly, the structure of our ionic colloidal crystals 
is determined by the size ratio (6) of the building blocks. For B between 
0.95 and 0.81 colloidal crystals isostructural to caesium chloride (CsCl) 
nucleate, which rapidly develop with a rhombic dodecahedral habit 
(Fig. 3b, f), composed of faces from the (110) plane. At lower £ values, 
the next structure observed occurs at 6 = 0.61, forming the aluminium 
diboride (AIB,) crystal structure, which develops with a characteris- 
tic needle-like habit”? (Fig. 3c, g). At B = 0.53, an exotic K,C,, phase is 
observed, which, when fractured, reveals large sheets of its (110) planes 
(Fig. 3d, h, Extended Data Fig. 4). Finally, 8 = 0.47 results in the nuclea- 
tion of crystals with a familiar NaCl structure (Fig. 3e). 

In addition to homogeneous nucleation from bulk suspensions, crys- 
tals can readily assemble via heterogeneous nucleation against charged 
substrates. This second self-assembly route allows us to bias the crystal 
growthalong specific crystallographic directions, effectively shaping 
the growing solids. This is illustrated in Fig. 4a (See also Extended Data 
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Fig. 5), in which CsCl planes are selected by a negatively charged glass 
substrate—namely the (100), (111) and (110) planes—in descending 
order of planar density of positive particles. Each plane templates the 
growth of CsCl crystals with a specific orientation, resulting in their 
speciation into three distinct crystal types with characteristic colours 
and shapes; yellow squares (100), green hexagons (111) and pink hexa- 
gons (110) (Fig. 4b). Specific colouration in the species arises from the 
uniformly oriented crystals scattering light at the same angle. Counting 
the species based on colour and shape allows rapid determination of 
the relative yields, where substantially more (100) planes are nucleated 
due to their higher density of positive particles, followed logically by 
the (111) and (110) planes. Substrate influence is more pronounced 
when the interaction strength is increased, leading to the formation 
of purely (100) planes. We conclude that differing crystalline pyramids 
can be selected to self-assemble by surface attraction in experiments 
and simulations (Fig. 4c, Extended Data Fig. 5). 
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Methods 


PACS model 
We estimate the repulsive potential V,in Fig. 1a using the Alexander-de 
Gennes polymer brush model” **: 
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where the PS radius ris measured by electron microscopy (/s¢y=250nm), 
the thickness of the F108 brush is taken equal to 10 nm (ref. ””) and the 
polymer surface density ois estimated as 0.09 poly(ethylene oxide) 
(PEO) chains per nm? (ref. *°). The attractive electrostatic potential 
V, is obtained by approximating the surface potentials of the particles 
(W, and Y_) with the measured zeta potentials (+39 mV and —59 mV, 
respectively). 


Colloidal model systems 

The principal model systems consist of PS particles synthe- 
sized via surfactant-free emulsion polymerization. For example, 
600-nm-diameter positive spheres were made from a single-step 
reaction comprising 500 ml of deionized water, 50 ml of styrene 
monomer (=99% from MilliporeSigma) and 0.5g of the radical ini- 
tiator 2,2’-azobis(2-methylpropionamidine) dihydrochloride (97% 
from MilliporeSigma). All the ingredients were mixed in a three-neck 
round-bottom flask, heated to 60 °C and stirred at 330 rpm under nitro- 
gen overnight. After the reaction had concluded, the particles were 
washed via repeated sedimentation and resuspension cycles. Control 
over the PS size was achieved by varying the monomer concentration, 
reaction temperature or reaction time. Negatively charged PS par- 
ticles were produced in the same fashion, replacing 2,2’-azobis(2-m 
ethylpropionamidine) dihydrochloride with an equivalent weight 
amount of potassium persulfate (=99% from MilliporeSigma). Fluo- 
rescent labelling of the PS systems was achieved by a swell-deswell 
method. In brief, fluorescent dyes dissolved in tetrahydrofuran (THF) 
were added to particles stabilized with Pluronic F108 (MilliporeSigma) 
toa final concentration of 30% v/v THF, then diluted by a factor of five 
before washing the particles via multiple sedimentation and resuspen- 
sion cycles to set them in pure water. The fluorescent dyes used were 
rhodamine-labelled aminostyrene and 7-nitrobenzo-2-oxa-1,3-diazole- 
2-(methylamino)ethanol. Rhodamine-labelled aminostyrene was 
prepared by adding 48 mg of rhodamine B isothiocyanate (mixed 
isomers from MilliporeSigma), 12 mg of aminostyrene (97% from Mil- 
liporeSigma) and 0.1 ml of tetramethylammonium hydroxide (25 wt% 
in water from MilliporeSigma) to 200-proof ethanol (10 ml), and stirring 
the mixture overnight, which was then stored at 5 °C. Monodispersed 
oil droplets used in the assembly of liquid—solid composite crystals 
(Supplementary Video 3) were prepared through the condensation of 
3-(trimethoxysilyl) propyl methacrylate (TPM, >98% from Millipore- 
Sigma), by adding 80 ul of ammonia (28 wt%) to 320 ml of deionized 
water, followed by the addition of 600 pl of TPM. This mixture was 
magnetically stirred for 1h, which resulted in the nucleation and growth 
of droplets of approximately 1 pm in diameter. Fluorescent labelling was 
achieved with rhodamine B isothiocyanate, coupled to 3-aminopropyl 
trimethoxysilane for covalent binding to the TPM network. The drop- 
lets can be used directly or solidified via a radical polymerization. To 
use the liquid droplets directly, the emulsion was gently centrifuged, 
the supernatant removed and the droplets resuspended in deionized 
water. This wash cycle was repeated three times. To solidify the TPM 
particles, 20 mg of azobisisobutyronitrile (98% from MilliporeSigma) 
was added to the suspensions and gently stirred for 5 min; then the 
suspension was left ina sealed container at 80 °C for 4h. The particles 
were then washed in the manner described above. 


Self-assembly of ionic colloidal crystals 

Positive and negative PS spheres were separately equilibrated inasolu- 
tion containing 0.03 mM of Pluronic F108 and the desired amount 
of NaCl, typically 3-5 mM. After 1h, the suspensions were rapidly mixed 
together while vortexing and then allowed to crystallize undisturbed 
in various containers (for example, glass vials, NMR tubes, Eppendorf 
centrifuge tubes or Petri dishes). Occasionally, samples were sealed 
in glass capillaries (VitroCom) and monitored over time via optical 
microscopy. Capillaries were pretreated with a hydrophobizing agent 
to facilitate the formation of a polymer brush when exposed to Pluronic 
solutions. The typical hydrophobization protocol consisted of a20-min 
exposure to trichloromethyl silane vapour inside of a moisture-free 
sealed chamber. 

Assembly experiments involving particles with different brush 
lengths were performed using PS with grafted polymers. While not 
necessary for the successful assembly of crystals, grafting ensures 
that particles maintain a fully saturated surface regardless of polymer 
solubility. This allows for a fair comparison between particles that carry 
different polymer spacers. Pluronic surfactants were permanently 
grafted to the surface of PS particles via a swell-deswell method”. The 
specific surfactants, namely, F108, F127 and F68, were added tothe PS 
suspensions at their critical micelle concentration (3.4 mM, 0.8 mM 
and 20 mM, respectively), then THF was added to the suspension toa 
final concentration of 50% v/v. The suspensions were left to equilibrate 
for 1h, and the THF was diluted to below 5% v/v before washing the 
particles via centrifugation and resuspension. A polymer solution at 
the critical micelle concentration was used for the first wash until the 
THF was reduced to below 1% v/v, then particles were finally set in pure 
water after three more wash cycles. Pluronic F38 was not grafted to the 
particles because it failed to produce crystals even when present in 
high concentrations (~3 mM). We believe that F38 is too small to serve 
as an effective spacer for our PS model system, so it was not further 
investigated. The assembly of polymer-grafted PS followed a similar 
protocol as forthe assembly of PS with physisorbed polymers. Typically, 
two PS suspensions (for example, 3 wt% 550-nm-diameter negative 
PS and 1.6 wt% 450-nm-diameter positive PS) with the same grafted 
polymer were equilibrated for 30 mininan excess of that polymer ata 
concentration of 80 pM and ata variety of salt concentrations based on 
the desired A,. After equilibration, the oppositely charged suspensions 
were rapidly mixed and immediately sealed in hydrophobized capillar- 
ies using hot wax. The capillaries were left undisturbed for 12 h, at which 
time the assembly behaviour was observed via optical microscopy. 

Our assembly strategy was successful with different types of poly- 
mers as long as they provided adequate spacing. The triblock copoly- 
mers described above have an A-B-A architecture, PEO-PPO-PEO, in 
which the central PPO block serves as the polymer anchor. We assem- 
bled ionic colloidal crystals, however, also using the polymer Brij s100, 
which has an A-B structure comprising a100-unit PEO tail andastearyl 
head group. Notably, crystallization occurs at an equivalent salt con- 
centration to Pluronic F127, which has equivalent length PEO chains, 
suggesting Brij forms a polymer shell of similar thickness. 


Fixing crystals 

Bulk samples have their supernatant slowly diluted with deionized 
water, avoiding large-scale flows and shear that would affect the sedi- 
mented crystal. The supernatant can be carefully exchanged, removing 
all the salt, at which point crystals are robust enough to be resuspended 
and dried. Crystals inside of sealed capillaries are fixed by submerg- 
ing the capillary in deionized water, and breaking the capillary’s ends, 
taking care to not disturb the product as muchas possible. The sample 
is allowed to equilibrate overnight, then removed to dry slowly for 
another day. The capillary can then be opened by scoring the sides with 
a glass cutter and cleaving off the top, exposing the fixed crystalline 
product. Epoxy resin (Norland 73) can be poured directly onto fixed 
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dried crystals. Bubbles can form as the resin permeates through the 
crystal, so the infusion was allowed to proceed for typically 15 min to 
allow air bubbles to evacuate. The resin was then polymerized through 
5 min of ultraviolet light exposure. 


Imaging and characterization 

Optical images and videos were acquired using a Leica DMI3000 
inverted microscope equipped with differential interference con- 
trast optics and a high-resolution camera (Hamamatsu ORCA Flash4.0 
sCMOS). Assembled crystals were typically imaged through crossed 
polarizers using a Nikon D5300 camera optically coupled to the micro- 
scope. Fixed crystals were imaged by electron microscopy using a MER- 
LIN (Carl Zeiss) field emission SEM. Fluorescent images were taken using 
aLeica SP8 confocal microscope. Zeta potentials were measured using 
a Malvern Zetasizer Nano ZS. 


Computer simulations 

Simulations were performed using HOOMD-blue v2.5 (compiled in 
single precision)”, using a single graphics processing unit. The pair 
interaction between two particles of types N (negative) and P (positive) 
with radius ry, and r, was defined by adding V, to the potentials given 
by equation (1), using HOOMD-blue’s tabulated potential option (with 
1,000 interpolation points between the close touching distance, ry+ Fp, 
andrN+r,+20A,). For the steric repulsion term, we usea brush length 
L=10nm, anda brush density o= 0.09 nm~. For the electrostatics, we 
used surface potentials ¥_=—-5O mV and YW, =+50 mV, a dielectric con- 
stant of 80 anda mixing rule for the pre-factor r= 2/(1/ry+ 1/). Unless 
otherwise stated, simulations were initiated as having 8,000 spherical 
particles ona simple cubic lattice such that the packing fraction is @, 
with random assignments of particle type. Values of @ of either 0.001 
or 0.003 tended to give a good amount of nucleation while still leaving 
enough free particles to assemble some crystals. Heterogeneous 
nucleation on acharged surface was modelled by adding an attractive 
Lennard-Jones wall for N (P) particles in the centre of the simulation 
box pointing up and down, with Lennard-Jones parameters o= 2ryp), 
and atunable e. For Extended Data Fig. 2e, we also added a repulsive 
potential to the other particle of type P (N) with the same parameters 


but where the potential was shifted up and cut off at its minimum 
distance r=2"opq). 


Data availability 


The data that support the findings of this study are available from the 
corresponding author on request. 
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Fixed & Dried 


Extended Data Fig. 1| Iridescent macroscopic single crystals. a, Single 
crystals with millimetre diameters visible on the bottom ofa10-cm Petri dish. 
Boundaries between single crystals are clear through the uniformly coloured 
regions of the Voronoi pattern that forms. b, The crystallinity of the bulk 
sedimentis visible through iridescence when left undisturbed. At5 mM of NaCl, 
the assembly is still reconfiguring and delicate. After fixing through dilution 
and slowly drying, the hardened sediment retains iridescence, denoting 
retention of crystalline order. In air versus water, iridescenceis muted duetoa 


Fixed & Dried 


large refractive index mismatch. c, Fixed crystals dispersed and freely floating 
in density matched water (Supplementary Video 4). Slow rotation of the 
crystals reveals their colouration’s angle dependence, displaying vibrant and 
uniformly coloured red, green, and blue iridescence as they rotate. d, SEM 
micrograph ofa single crystal, examined at high magnification at four 
locations across the surface of the crystal. At each site, the crystal displays the 
same crystallographic plane and angle, demonstrating that the whole crystal is 
in register. Scale bar, 100 pm. Scale bar of high magnification sites, 5 um. 
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24 hours 


Extended Data Fig. 2| Crystals annealing with strong electrostatic Images were taken at 1 min (a), 5 min (b) and 20 min (c).d, After 24 h, the sample 
attraction. a-c, A20-mintime lapse of particles calculated to havea potential has crystallized. e, Crystals produced in this manner display the (100) plane due 
well depth of 12 kT. Coagulation occurs immediately after mixing and tothe strong electrostatic attraction to the negatively charged substrate. Scale 


disordered heteroaggregates rapidly sediment. Although noorderispresent,a bars, 10m. 
small degree of reconfiguration can be observed on the single-particle level. 


Extended Data Fig. 3 | Crystal habits. a, SEM micrographs of crystal facets 
from arhombic dodecahedral CsCl ionic colloidal crystal. Scale bars, 10 pm. 

b, Left, SEM micrograph of small CsCl crystals with dodecahedral habit. Close 
inspection reveals sides comprised of the (110) plane. Scale bar, 10 pm. Right, 
optical micrograph of CsCl crystals. Scale bar, 100 pm.c, Left, SEM micrograph 


ofan AIB, needle. Close inspection reveals crystals growing witha railroad 
pattern characteristic of their (1010) plane. Scale bar, 3 pm. Right, optical 
micrograph of needles growing aligned in bunches and reveal the same colour, 
insets of red and greens depending on the angle of orientation. Scale bar, 

120 pum. 


Article 


ia 


Extended Data Fig. 4| Layered pattern of K,C,, crystals. SEM micrographs of a bulk K,C,o ionic colloidal crystal captured at increasing magnification. Cleavage 
of the crystals reveals sheets of the (110) plane. Scale bars, 5 um. 
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Extended Data Fig. 5 | Substrate charge influence on heterogeneous 
nucleation of crystals. a, Schematic of substrate charge influence, where the 
(100) plane of either charge grows onan oppositely charged substrate. Bottom 
left, (100) plane of positive (white) particles growing ona naturally negatively 
charged glass substrate. Bottom right, glass surface treated with aminosilane 
for positive charge nucleates a negative (100) plane. Scale bars, 6 pm. b, Bright 
field microscopy image showing a macroscopic crystal nucleated ona 
negatively charged substrate. The magnified inset reveals the characteristic 
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pattern of the (100) planein contact with the substrate. c, Image of faceted 
CsCl crystals that nucleate spontaneously in simulation, whereA,=4.5nm, 
ry=500 nm andr,=430 nm. d, Heterogeneous nucleation is observed whena 
model attractive surface is added under the same conditions as b. The 
attractive surface has a strength of e=5k,/ for positive particles. e, Structures 
heterogeneously assembled relative to these facets were observed with an 
attractive and repulsive strength of ¢=3k,7 and with an increased size of N 
particles such that the ratio of 0,/o, =1.24. 
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The May 2018 rift intrusion and eruption of Kilauea Volcano, Hawai‘i, represented one 
of its most extraordinary eruptive sequences in at least 200 years, yet the trigger 
mechanism remains elusive’. The event was preceded by several months of 


anomalously high precipitation. It has been proposed that rainfall can modulate 
shallow volcanic activity”, but it remains unknown whether it can have impacts at the 
greater depths associated with magma transport. Here we show that immediately 
before and during the eruption, infiltration of rainfall into Kilauea Volcano’s 
subsurface increased pore pressure at depths of 1to 3 kilometres by 0.1to1 
kilopascals, to its highest pressure in almost 50 years. We propose that weakening and 
mechanical failure of the edifice was driven by changes in pore pressure within the rift 
zone, prompting opportunistic dyke intrusion and ultimately facilitating the 
eruption. A precipitation-induced eruption trigger is consistent with the lack of 
precursory summit inflation, showing that this intrusion—unlike others—was not 
caused by the forceful intrusion of new magma into the rift zone. Moreover, statistical 
analysis of historic eruption occurrence suggests that rainfall patterns contribute 
substantially to the timing and frequency of Kilauea’s eruptions and intrusions. Thus, 
volcanic activity can be modulated by extreme rainfall triggering edifice rock 
failure—a factor that should be considered when assessing volcanic hazards. Notably, 
the increasingly extreme weather patterns associated with ongoing anthropogenic 
climate change could increase the potential for rainfall-triggered volcanic 
phenomena worldwide. 


Compelling evidence exists for seismicity generated by rainfall*>. 
Rainfall-induced stress changes at depth can promote fault initiation 
and reactivation—a mechanism invoked for numerous geological phe- 
nomena, including landslides®, silent slip events’ and remote triggering 
of earthquakes®. Rainfall can also interact with hot volcanic lava domes, 
causing gravitational dome collapse’, explosions” and the generation 
of lahars and other flow phenomena”. Explosive activity has even been 
linked to specific weather systems”*. These mechanisms probably influ- 
ence volcanic activity only in the upper tens or hundreds of metres”, 
prompting the suggestion that rainfall may only bea viable trigger for 
volcanic activity in the shallow subsurface”. The only studies to link 
precipitation to deeper processes consider hydrological loading and 
unloading of the edifice’*”’. The question as to whether and how rain- 
fall can directly induce deep magmatic activity remains unanswered. 

In 2018, coincident with prolonged and extreme rainfall, Kilauea 
underwent a complex, multistage eruption involving an extensive rift 
eruption and the collapse of the summit caldera. On 30 April a train 
of seismicity initiated along the East Rift Zone (ERZ) that was inter- 
preted as a downrift dyke intrusion’, ultimately breaking ground as a 
fissure eruption on3 May, and followed the next day by a magnitude-6.9 
earthquake. The summit exhibited numerous explosive eruptions and 
caldera collapse events that continued through to August; activity at 
the Lower ERZ was characterized by fissuring and lava fountaining. 


This represents one of Kilauea’s most remarkable eruptive sequences 
over the past two centuries, not least because the initiation mechanism 
remains equivocal’. We investigate several lines of evidence that sug- 
gest that anomalous rainfall weakened the edifice by instigating an 
impulsive pressure wave that propagated to depth and modified the 
local effective stress in the rift zone. We hypothesize that this in turn 
triggered the dyke intrusion and the eruptive aftermath. 

Dyke initiation from a magma chamber can be considered in terms 
of the static tensile failure criterion around a cavity. Mechanical failure 
and dyke initiation will be achieved once the magma overpressure, 5p, 
achieves a threshold value defined by”: 


5p, = K(egz— [p + 8p] + 7) (1) 


where the product of the rock density, 9, gravitational acceleration, g, 
and depth, z, is the lithostatic stress; p and 6p are the hydrostatic pore 
pressure and the pore-pressure change, respectively; and Tis the tensile 
strength of the host rock. In an elastic medium, the magma overpres- 
sure is proportional to the tangential stress at the chamber wall, and 
their ratio, x, isa function of the chamber geometry: for example, x=2 
for a spherical chamber within an infinite space”. Equation (1) high- 
lights that—all other things held equal—an increase in pore pressure 
will decrease the failure overpressure required to initiate chamber-wall 
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Fig. 1|Pre-eruption ground deformation of the study site. a, Vertical 
deformation map derived from Sentinel-1InSAR data (ascending track 124 and 
descending track 87) between December 2014 and June 2017. Inset, map 
showing the location of Kilauea Volcano (red triangle) and the HVNP rain gauge 
(H) within the Eastern Rift Zone. The yellow box outlines the 0.25° x 0.25° 
TRMM/GPM footprint used here. The Paradise Park well (P) and Hawai‘ian 
Beaches rain gauge (B) are also highlighted (see Extended Data Fig. 1). The 


failure and dyke initiation by the amount x6p. Increasing fluid pressure 
facilitates not only the initiation of a dyke but also its propagation, inthe 
sense that the fracture toughness (or energy) of the edifice material is 
dependent onthe ambient stress field'* and thus on the pore pressure. 
Effective stress (that is, lithostatic stress minus pore pressure) governs 
the failure stress of volcanic materials”. During and after heavy rainfall, 
animpulsive increase in groundwater volume will cause a perturbation 
in pore pressure at depth, increasing it transiently above the back- 
ground hydrostatic condition. Under unconfined, saturated conditions 
typical of basaltic systems, such pore-pressure transients are often 
reflected by water-level changes in nearby wells””: this is demonstrated 
in Extended Data Fig. 1, where we correlate rainfall with recorded well 
data froma site 15 km north of the rift zone. In theory—if the volume 
and rate of meteoric water infiltration are sufficient—mechanical fail- 
ure can be induced inthe vicinity of amagma chamber or dyke, inturn 
triggering intrusion or eruption. 


Interferometric synthetic-aperture radar (InSAR) data show that 
more than 0.3 m of uplift occurred at the summit between 2014 and 
mid-2017 (Fig. 1a), yet only around 0.01m of uplift was detected at the 
summit over the following 10 months (Fig. 1b). This temporal defor- 
mation pattern is confirmed by data from GPS stations proximal to 
Kilauea’s caldera (CRIM and AHUP), showing that moderate inflation 
occurred between 2010 and 2015, followed by a substantial increase in 
inflation rate until around June 2017 (Fig. 1c), after which inflation was 
negligible for the following 10 months. In the upper and middle ERZ, 
no substantial inflation was detected between 2010 and 2018 (Fig. 1b; 
GPS stations MKAI and KTPM in Fig. 1c). This implies that the inflation 
observed from mid-March’ reflects local, shallow processes, rather 
than wholesale pressurization of the system. The lack of precursory 
summit and rift-zone inflation suggests that the intrusion-eruption 
was not triggered by an influx of fresh magma from depth but that it 
was a passive intrusion, caused by extension and/or weakening of the 
rift zone”. 

In early 2018, the Hawai‘ian islands were subject to protracted, at 
times extreme, rainfall (Fig. 2). The maximum peak in the rainfall power 
spectrum occurs at 1 yr“, indicating substantial annual seasonality in 
rainfall over Kilauea Volcano on Hawai'i Island (generally, most rainfall 
occurs between 9 March and 25 August), overlain by anon-negligible 
stochastic component (Fig. 2b). This annual signal accounts for over 
half of the variability in rainfall over Kilauea; however, an aseasonal 
shift of the synoptic-scale atmospheric wave pattern across the North 
Pacific in mid-to-late January 2018 preceded the passage of several 
consecutive low-pressure systems over Hawai‘iin the months to follow. 
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dashed box indicates the region shownina, b. Inb, the filled squareisa 
reference point; in a,b, opensymbols represent GPS station locations. 

b, Vertical deformation between June 2017 and May 2018.c, Vertical GPS 
displacement of the Kilauea flank from 2010. Station names are shown on the 
panel; colours correspond to locations ina, b. Dashed vertical lines highlight 
marked changes-of-slope in station data proximal to the summit (for CRIM and 
AHUP). Datasets have been offset from the x axis for clarity. 


Several months of greater than average rainfall culminated in record 
downpour, with 1.26 m of rain falling within 24 h (14-15 April 2018) on 
Kauai Island (northwest of Hawai‘ilsland)—a record not only for Hawai‘ 
but for the entire United States”*. Calibrated precipitation data from 
NASA’s Tropical Rainfall Measurement Mission and Global Precipitation 
Measurement mission (TRMM/GPM) satellites indicate that over 2.25m 
of rainfall fell over Kilauea during the first quarter of 2018, compared 
with a first-quarter 19-year average of 0.90 m. Figure 2c, e shows the 
multiday cumulative sum of rainfall calculated as a moving window 
across the time series, using windows of 30 days (Fig. 2c, approximately 
1 month) and 180 days (Fig. 2e, approximately 6 months). These data 
are lognormal (Fig. 2d, f). Notably, the 30-day total rainfall exceeds 2 
standard deviations of the mean (+2¢) immediately before the 2018 
flank eruption: that is, a statistically significant deviation. Even more 
strikingly, the rolling 180-day cumulative rainfall has only two periods 
in which rainfall exceeds this threshold, one of which directly precedes 
the eruption. 

Kilauea Volcano is hydrogeologically complex”, as highlighted by 
laboratory data (for example, see ref. >; see Methods). Accordingly, we 
model the edifice as two connected saturated domains ina one-dimen- 
sional half-space (model a): a highly permeable shallow layer (0-500 m) 
overlying an intermediate-permeability domain (0.5-10 km). Propa- 
gation of pore pressure p from the surface owing to precipitation is 
modelled using a finite difference approximation to solve for transient, 
vertical flow of groundwater (thatis, the diffusion problem), such that 
(Op/0t) = D(0"p/dz’), where zis depth, tis time and Dis hydraulic dif- 
fusivity (a function of permeability, bulk modulus, fluid viscosity and 
porosity). Full details, including parameter values, are given in Meth- 
ods. For model a, D=37 m’s ‘and 0.34 m’s ‘above and below 0.5 km 
(below the surface), respectively (Fig. 3a). Using the calibrated satel- 
lite data as a variable-flux boundary, we show the resultant maximum 
pore-pressure change in Fig. 3b. Three additional models are shown 
for reference to demonstrate the range of feasible pressure responses 
to rainfall (see Methods and Extended Data Table 1). 

In all modelled scenarios, we observe a quasistatic pore-pressure 
build-up of tens to thousands of pascals at depths 1-6 km below the 
surface immediately before the onset of the 2018 flank eruption. For 
completeness, we include results from the unlikely end-member sce- 
narios in Extended Data Fig. 2. More complex and realistic models yield 
intermediate pressure changes (Fig. 3b and Extended Data Fig. 2), as 
demonstrated in Fig. 4a—c. The April 2018 peak in pore pressure is the 
highest observed throughout the modelled period (for example, see 
Fig. 4b, c). Parametric analysis of each of the model frameworks shown 
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Fig. 2 | Rainfall over Kilauea. a, Calibrated daily rainfall amount over Kilauea 
from March 2000 toJuly 2019 (TRMM/GPM data calibrated with gauge data; 
see Fig. lainset and Methods). The vertical bar shows the date of flank 
eruption (E). b, Power spectrum of Fourier-transformed rainfall time series (a) 
shown in the (normalized) power-frequency domain. c, Rolling 30-day 


in Fig. 3a reveals that this pattern holds for a wide range of feasible 
physical properties. In our preferred model (model «), we obtain pres- 
sure changes of around 0.1 kPa and 1 kPa at 3 km and 1km below the 
surface, respectively. 

At depths of around 3 km belowthe surface (or 1.8 km belowsea level 
(b.s.I.), the depth estimated for most lateral magma transport in the 
rift zone’), the elevated subsurface pore pressure in early 2018 was the 
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Fig. 3 | Diffusion model metadata. a, Diffusivity (a function of permeability, k, 
bulk modulus, K, fluid viscosity, 1, and porosity, @, given by kK/u@) against 
depth (depth below the surface and depth b.s.1. are shown for reference) for 
four different models (k— @ scenarios, summarized in Extended Data Table 1). 
The primary model (model a) is atwo-layer model, witha zone of high porosity 
and permeability overlying a lower permeability domain. Reference model Q, 
subdivides the lower domain into two, each with different properties. Models 
Q, and QO, are single-domain models, meaning that they use values of porosity, 
permeability, bulk modulus and viscosity that are constant with depth. 

See Methods for more details. b, The maximum pressure change effected at 
depth in each model throughout the modelled time series. Note that the 
maximum pressure change for any given depthcan occur on numerous dates 
and may not be consecutive. For example, high values near the surface may 
occur frequently (see Fig. 4a, c) but may not propagate to depth. Onthe other 
hand, high pressure at depth may not bea direct consequence of asingle spike 
at the surface. 


500 
Count 


1,000 


cumulative rainfall since March 2000 at Kilauea. Horizontal dashed lines 
delineate two standard deviations either side of the mean (x+¢).d, Histogram 
of datainc on log-linear axes. e, Rolling 180-day cumulative rainfall. 

f, Histogram of data frome on log-linear axes. 


largest peak in pressure in 47 years—the highest since the onset of the 
Pu‘u ‘06 eruption (Fig. 4b, c). Given the mechanism described above, 
this is a strong indicator that elevated pore fluid pressures facilitated 
the 2018 intrusion and rift eruption. Moreover, we outline four inde- 
pendent lines of evidence to support this proposition. 

First, there was very little precursory inflation immediately before 
the rift eruption (Fig. 1b, c). The rapid uplift recorded by tiltmeters in 
the Pu‘u ‘0 area from mid-March reported in ref. ‘thus reflects a highly 
localized, shallow deformation source. The absence of widespread pre- 
cursory inflation suggests that the 2018 intrusion was passive, fostered 
by weakening of the rift zone. 

Second, a statistical analysis of the Kilauea’s reported historical 
eruptions shows that the volcano exhibits a marked tendency towards 
erupting during the wettest times of the year (Extended Data Fig. 3): 
the onset of around 60% of reported eruptions since 1790 (including 
the Pu’u ‘O’6 eruption 1983-2018) occurred during the ‘rainy’ season, 
despite the fact that Kilauea’s ‘rainy’ season is shorter than its ‘dry’ 
season (see Methods). 

Third, recorded intrusions appear to be correlated with elevated 
pore pressures at depth. Figure 4b shows intrusions into the rift zone 
since the 1975 Kalapana earthquake (compiled after refs. 77° °). Com- 
paring the modelled rainfall-induced pressure perturbation at a depth 
of 3 km below the surface with the long-term average, more than 60% 
of intrusions (20 of 33) are associated with periods of above-average 
pore pressure. If we compare the pore-pressure change with the rolling 
four-year average at the same depth in order to account for interan- 
nual fluctuations (such as the El Nifio-Southern Oscillation) we find 
that 19 (58%) were initiated when the pore pressure was above this 
threshold. Clearly, not every peak in pore pressure is associated with 
an intrusion, and vice versa—a function of superposing processes with 
different periodicities (a combination of internal and external forcing), 
and thus characteristic of triggered systems (see, for example, ref. “). 
Nevertheless, it is striking that intrusions are approximately twice as 
likely to occur at Kilauea when pore pressureis elevated, suggesting that 
pore pressure can act as a trigger mechanism in a critically stressed—or 
‘primed’—volcanic system. 

Finally, historical precipitation records show that Kilauea’s May 
1924 eruption—the previous extraordinary eruption—also followed 
extremely wet conditions. In April 1924, many stations across Hawai‘i 
recorded as muchas 0.5 min excess of the long-term average for that 
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Fig. 4| Pore-pressure change in response to infiltration into Kilauea’s 
edifice. a, Pore-pressure change modelled over the period January 1975 to April 
2019, using available HVNP gauge data (1950-2015) and calibrated satellite data 
(2000-2019), with depth and time. Daily rainfall data are used as a fluctuating 
boundary condition. Colour scale indicates pressure change. b, Pore-pressure 
change at 3 km below the surface (1.8 kmb.s.I.) modelled over the period 
January 1950 to April 2019 (data shown are since the 1975 Kalapana earthquake). 
The dashed line shows the four-year running average. K represents the 1975 
magnitude-7.2 Kalapana earthquake; P shows the 1983 onset of the Pu‘u‘O‘O 
eruption; E represents the 2018 Kilauea rift intrusion—eruption. Vertical bars 
show reported intrusion events within the rift zone, after refs. ”?°**, Intrusions 


month, and the gauge measurements for several of these were at that 
time the highest daily rainfall amounts on record”. In particular, more 
than 0.1 m fell at the Hawai‘i Volcano Observatory in 24 h (14 April). 
The 2018 eruption echoed many features of 1924 (for example, major 
summit explosions and a drop in lava lake level), suggesting that not 
only the timing of intrusions and eruptions but also the eruptive style 
of Kilauea is influenced by rainfall. 

Diffusion modelling shows that rainfall can induce quasistatic 
pressure changes on the order of kilopascals to tens of kilopascals at 
depths of a few kilometres (Fig. 4 and Extended Data Fig. 2, respec- 
tively). Increasing pore pressure can cause embrittlement”, hydro- 
fracture and dyke initiation” by reducing the static threshold for 
tensile failure. Although the precise magnitude of stress change will 
vary owing to drainage and the thermal contribution of the magma, 
pressure changes on the order of 1kPa are in line with trigger stresses 
caused by solid Earth tides? * and those required to trigger earth- 
quakes on pre-existing faults**. Pressure changes on the order of 10 kPa 
are typically assumed to be necessary to trigger mechanical failure and 
attendant geological processes in unstressed media‘; however, it has 
been demonstrated that stress changes in the range of 0.1-1 kPa are 
sufficient for rainfall to trigger earthquakes provided that the crustis 
inacritical state>. Ithas been inferred from historical eruption patterns 
that Kilauea is particularly sensitive to external modulation and erup- 
tion triggering”®; moreover, recent evidence shows that the rift zone 
has undergone mechanical weakening over the course of the Pu‘u ‘OO 
eruption”. Although we do not account here for potential amplifica- 
tion or dampening of the precipitation-induced pressure change in 
response to poroelastic effects, our results show that the quasistatic 
stress changes associated with pore-pressure infiltration can effect a 
sustained increase above long-term background levels in a way that 
dynamic stresses of similar magnitude (for example, solid Earth tides) 
cannot. Relatively small changes in edifice pore pressure can result ina 
substantial reduction in the overpressure required to instigate magma 
chamber failure (Equation (1)). As such, precipitation-induced pore- 
pressure fluctuations contribute to the overall stress state of Kilauea 
Volcano; we propose that this hydromechanical coupling may directly 
trigger primary volcanic activity. 

The unprecedented rainfall over Hawai‘i in the months before the 
2018 flank eruption increased the potential for mechanical failure 
within the edifice. Taken together, the separate lines of evidence 
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are highlighted in yellow if they coincide with periods during which pressure 
change exceeds the four-year average, and grey if they do not. Intrusion 33 in 
this time series corresponds to the early 2018 activity (intrusion detected mid- 
March, followed by the rift eruption on 3 May). The arrow highlights the 
maximum pore-pressure perturbation over this timeframe (1975 to 2019), 
coinciding with the onset of 2018’s rift eruption. Horizontal bars indicate data 
availability. c, Probability density function (PDF) of modelled pressure change 
at depths 1-6 km belowthe surface. Arrows highlight the pore-pressure front 
diffusing from near the surface (1km) to greater depths over time (26 March, 4 
April, 23 April, 22 May, 22June and 4 July 2018 at1km,2km,3km,4km,5kmand 
6kmbelowsurface). 


reported above strongly suggest a correlation between rainfall and vol- 
canic activity at Kilauea—not only in 2018, but throughout its eruptive 
history. By locally reducing effective stress at depth, prolonged peri- 
ods of rainfall may induce opportunistic dyke intrusions or facilitate 
dyke propagation. The historical preponderance of Kilauea’s eruptions 
during the wettest parts of the year buttresses this theory, as does the 
coincidence of dyke intrusions with elevated subsurface pore pressure 
and the similarities observed between volcanic events associated with 
similar rainfall patterns. Critically, as our climate continues to change, 
the occurrence of prolonged periods of extreme rainfall is predicted to 
increase in many parts of the world, increasing the potential for rainfall- 
triggered volcanic phenomena. Elevated pore pressures at depth tend 
to be fostered and maintained by prolonged periods of above-average 
rainfall, associated with long-lived and generally forecastable synoptic- 
scale systems: by better understanding the hydromechanical couplings 
between rainfall and volcanism, advanced warning of rainfall-induced 
volcanic hazards may be achievable. 
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Methods 


Rainfall data 

To obtain a contiguous rainfall time series at Kilauea, we extract pre- 
cipitation data for the period March 2000 to June 2019 from the NASA/ 
JAXA Tropical Rainfall Measuring Mission (TRMM) and Global Pre- 
cipitation Measurement (GPM) satellite datasets, which are operation- 
ally available from NASA from March 2000. We use the 3B42 Research 
Version product (https://doi.org/10.5067/TRMM/TMPA/3H/7), a 
0.25 x 0.25° gridded daily product resampled from the original three- 
hourly rainfall estimates. Although this provides a daily rainfall estimate 
over the entire time frame, these data are an areal average and under- 
represent the true volume of rainfall at Kilauea by a factor, A, of 5.12. 
We calibrate the satellite data using rain gauge data available between 
1950 and 2015 (gauge H in Fig. 1a, inset). The gauge data are from the 
Hawai‘i Volcano National Park rainfall gauge (Global Historical Clima- 
tological Network Daily (GHCND) identification code USC00511303), 
available through the Climate Data Online project of the National Oce- 
anic and Atmospheric Administration (NOAA). This gauge site was a 
volunteer observer station which closed as of 22 June 2015. 


Ground deformation 

Vertical velocity data were obtained by processing Sentinel-1 ascending 
and descending SAR data. Data were processed using the US Jet Propul- 
sion Laboratory (JPL)-developed InSAR Scientific Computing Environ- 
ment (ISCE) open-source software package’, and further time-series 
analysis was performed using the MintPy software toolbox (Miami 
INsar Time-series software in PYthon), developed at the University 
of Miami”. Vertical GPS data (Fig. 1c) were accessed from the Nevada 
Geodetic Laboratory for GPS stations CRIM, AHUP, KTPM and MKAI 
(Fig. 1a, b; http://geodesy.unr.edu/NGLStationPages/stations/; ref. *°). 
Vertical data are extracted directly with associated vertical error. 


Modelling approach 

We model the subsurface pore-pressure perturbation as a function of 
rainfall by using a finite difference approximation to solve for a tran- 
sient, vertical flow of groundwater (that is, the diffusion problem), 
such that (0p/dt) = [kK/u@]0°p/dz’]. Here, pore pressure, p, over time, 
t,and depth, z, beneath the surface is a function of the permeability, k, 
the bulk modulus, K, and the porosity, @, of the edifice, as well as the 
viscosity, y, of the percolating fluid (water). The expression kK/u@ 
represents the (hydraulic) diffusion coefficient. Assuming a zero-flux 
boundary, we impose a pressure change of zero at the base, Z, of the 
domain, which is arbitrarily given as 10 km below the surface (8.8 km 
b.s.I.), that is, (0p/0z)|,_7~ 5p|,-7= 0. We note that this is deep enough 
that we do not observe boundary effects in the depth range of inter- 
est. Although subsurface pressure data for calibration are scarce, this 
solution appears to reflect pressure changes at depth well (Extended 
Data Fig. 1), as it relies on physical parameters. 

Pressure at the surface is defined given the calibrated height of 
recorded rainfall, h, the density of water, e,, and the acceleration dueto 
gravity, g, such that p(z=0, t) =Ae,gh(t), where A =5.12 is the calibration 
factor described above, and z represents the depth below the ground 
surface. To test the sensitivity of our model to the rainfall input, we 
also ran it assuming daily rainfall values of +10% of the recorded value: 
we note that the relative timing and magnitude of pressure evolution 
varies negligibly as a result. 

The uppermost 500 mor so of the Kilauea volcano comprises some 
of the planet’s most permeable known geological materials (with per- 
meabilities on the order of 10 m’, on the basis of laboratory meas- 
urements” and simulations”). Vertical permeability, k,, however, is as 
muchas three orders of magnitude lower than horizontal permeability, 
k, (refs.***”)—a consequence of surface-parallel layering anisotropy®. 
Below the water table, equivalent (bulk) permeability is anticipated 
to be lower still. Modelling and mud-loss permeabilities“ suggest 


that k, isin the range 1x 10 m’ to 6 x 10“ m’, with k, estimated to be 
perhaps a factor of 10 or 100 lower, but not less than around 10° m? 
(ref. *°). This is greater than laboratory measurements in this interval, 
partially because large-scale fractures are not encompassed by such 
measurements”, and partially because sample recovery of friable, 
high-permeability materials is inherently low”. Reference” reports 
modal porosity values of edifice-forming basalt at Kilauea of 0.15-0.3. 
This range is in agreement with a wealth of experimental data*® for 
the typical porosity of a volcanic edifice. Thanks to these studies, we 
have a reasonable site-specific estimate of the permeable architecture 
beneath Kilauea. 

Deformation experiments on edifice-forming volcanic materials” 
reveal typical values of pore compressibility (, the inverse of the bulk 
modulus) on the order 10°” Pa”. Accordingly, we assume a value of 
K=B"'=10 GPa throughout this study. Interstitial fluid (water) viscos- 
ity, p1, is similarly assumed to be constant (8.9 x 10* Pas“). Perme- 
ability and porosity (the remaining parameters governing rainwater 
percolation through the edifice) are neither constant with depth nor 
independent of each other, however. Our preferred model (model a) 
comprises two domains ina one-dimensional half-space, the uppermost 
of which is highly permeable and porous (k=1x 10" m’and @=0.3). The 
underlying portion, deeper than 500 m, is less permeable (6 x 10°» m?) 
and less porous (@ = 0.2). For convenience, these data are shown in 
Extended Data Table 1. The results of three different additional mod- 
els are shown in Fig. 3. The first is a three-section model (here called 
model Q,), which assumes that the upper 500 m of the edifice overlies 
intermediate- and low-permeability domains. Model ©, and model 
Q, are both homogeneous equivalent permeability models (that is, a 
single value of permeability is used throughout the domain), based on 
the parameters of model Q,. Model O, reflects the arithmetic average 
permeability of the domains in model O, (k,=8.3 x10” m’), while model 
Q, uses the geometric average permeability of the domains in model 
Q,: k= 5.4 x 10 m’. Values are given in Extended Data Table 1. Note 
that these latter models represent extreme and unlikely end-members 
and are shown here only for completeness. 

The homogenous models (Q, and Q3) essentially represent upper 
and lower bounds of the pressure change at depth, even though the 
heterogeneous models may contain domains of lower equivalent per- 
meability. Model ©, exhibits very little pressure attenuation of pressure 
with depth and time because of the high permeability value assumed, 
and may be representative of an extensively fractured edifice in a self- 
organized critical state**, which gives rise to hydraulic continuity from 
the surface to depths of several kilometres (that is, close to the pressure 
response of a theoretical kilometres-long vertical crack). Despite the 
rapid attenuation of model Q, (based on the geometric average per- 
meability), we highlight that a pressure perturbation of over 0.1 kPa 
is still observed at 3 km depth. The geometric average is thought to 
be areasonable compromise between direction-specific averaging 
approaches where the precise geometry and orientation of an aniso- 
tropic medium is unknown’, and so represents an approximation ofa 
series of randomly oriented unfractured geological units. Relative to 
these homogenous scenarios, the heterogeneous models (a and Q,) 
yield intermediate deviations in pressure, and comparing these models 
with the limited well-level data available indicate that they capture the 
evolution of subsurface fluid pressure in our study area (Extended Data 
Fig. 1). Inall modelled scenarios we observe a build-up of pore pressure 
of tens to thousands of pascals at a depth of 3 km immediately before 
the onset of the 2018 rift eruption. Exploring the (k, @, z) parameter 
space for each of the model frameworks shown in Fig. 3a reveals that 
this pattern holds for a wide range of feasible physical properties ina 
depth interval generally assumed to accommodate magma transport 
between Kilauea’s relatively shallow magma source and the surface. In 
particular, this April 2018 peak is observed when the diffusivity coef- 
ficient falls within the range 0.1-1.0 for the majority of the edifice, a 
range in agreement with other studies that investigate fluid percolation 


through volcanic media*****° and corresponding to a wide range of 


physically tractable combinations of k and @. 


Observed well-level data 

There is a paucity of available subsurface pore-pressure data for Kilauea. 
Reference® shows pore-pressure data collected for a few weeks in 2001: 
inthis case, the time series is too short and the signal from a coincident 
magmatic intrusion too strong to detect the input of meteoric water. 
In earlier work, pressure or head data are reported for anumber of 
wells throughout the region around Kilauea, but are generally heavily 
modulated by ocean tides or located far from the rift zone. Despite 
these known issues, we show atime series of well-level change data 
digitized from ref.*, in order to highlight that a diffusion-based mod- 
elling approach is appropriate for describing groundwater evolution 
within the East Rift Zone (see also refs. °°”). The head data are derived 
from aquifer tests performed in the early 1990s at the Paradise Park 
well (located at longitude -154.976, latitude 19.596; state well number 
3588-01; see Fig. la inset), located approximately 15 km north of the rift 
zone, for which there is anear-continuous record between October 1992 
and September 1993. Reference” highlights that the wellislocatedina 
portion of the aquifer that effectively dampens the tidal signal, meaning 
that the primary influence recorded ought to be that of the rainfall. We 
model pore-pressure evolution using rainfall as an input, as described 
inthe ‘Modelling approach’ section above. In this case, we use data from 
the Hawai‘ian Beaches rain gauge (approximately 9 km southeast from 
the Paradise Park well; see Fig. lainset), available between September 
1992 and August 2005S. For simplicity’s sake, we use the same input 
parameters as determined for Kilauea Volcano itself (based on collated 
experimental, modelling and drilling data), although we acknowledge 
that the subsurface structure at Paradise Park may differ somewhat. 
Because our diffusion model solely comprises physical, measurable 
parameters (permeability, porosity and so on), there should be no 
need—in theory—to tune it empirically, provided that our knowledge 
of the subsurface physical properties is sufficient. 

Extended Data Fig. 1b shows the well-level change data”, alongside 
the pore-pressure change data at 1 km depth modelled here on the 
basis of rainfall records from the Hawai‘i Beaches gauge (Extended 
Data Fig. 1a; see Fig. 1a inset for location). Although some discrepancies 
remain, clearly the modelling approach used here reproduces much of 
the subsurface pressure response resulting from rainfall infiltration (in 
particular the timing and relative magnitude of peaks, as highlighted by 
the grey bars). As such, we consider this model appropriate to describe 
subsurface pore-pressure evolution within the rift zone without empiri- 
cally varying the diffusivity parameter, D. 


Additional model results 

Four models are described in the main text, with results shown from 
our preferred model, a. For reference, results from the three additional 
models are shown in Extended Data Fig. 2. 

The parameters of model Q, are based on previous experimental, 
numerical and in situ drilling data (see the ‘Modelling approach’ sec- 
tion above). Note that the maximum pore pressure occurs immediately 
before the 2018 fissure eruption (Extended Data Fig. 2). Based on model 
Q,, 64% of intrusions (21 out of 33) occur when pore pressure is above 
the four-year average. Models ©, and Q; represent theoretical end- 
member values for hydraulic diffusivity within the East Rift Zone, and 
donotnecessarily reflect realistic pressure-evolution scenarios. Twelve 
per cent of intrusions (4 of 33) occur when pore pressure is above the 
four-year average for Q,. Fifty-five per cent of intrusions (18 of 33) occur 
when pore pressure is above the four-year average for Q3. 


Binomial probability analysis of eruption record 

Fourier analysis of the satellite-derived rainfall time series (see, for 
example, Fig. 2) reveals that rain falls over Kilauea predominantly 
between 9 March and 25 August, which we define as the ‘wet’ season. 


This period covers 46% of the year and accumulates >64% of annual 
rainfall. We analyse Kilauea’s eruption record (based on the onset 
date of eruptions defined in the Smithsonian Institution’s Global Vol- 
canism Program eruption database: https://volcano.si.edu/volcano. 
cfm?vn=332010). Note that this includes the onset of the 1983 Pu‘u 
‘OO eruption. If eruptions were randomly (or uniformly) distributed 
throughout the year, we would anticipate them to occur during the ‘wet’ 
season with a probability of around 0.46. However, the ratio of ‘wet’- 
season to ‘dry’-season historical eruptions is more than 0.58. We can 
assess the prevalence of eruptions occurring during the ‘wet’ season 
by using binomial probability analysis based on the modelled time- 
series and Kilauea’s eruption record. The probability is calculated by: 


g(x) =[n!/x!(n-x) !]ge*(1- gy)" * 


where go(x) is the probability of ‘successes’ out of n trials; g is the prob- 
ability of success of a given trial (we define a ‘success’ as an eruption 
that occurs within the predefined ‘rainy’ season for Kilauea); andn 
corresponds to the total number of observed eruptions in our dataset. 
This analysis allows us to determine the probability of the observed 
ratio of ‘wet’-season eruptions to ‘dry’-season eruptions occurring 
fortuitously. The mean and standard deviation of this distribution are 
given by x = ng andc¢= ./xX(1- go), respectively. 

The observed value (Extended Data Fig. 3) is just within the conven- 
tional 2¢ range, which suggests that it could be a fortuitous distribution 
(that is, many more eruptions could have occurred during the ‘wet’ 
season by chance). Although this result in isolation precludes strong 
statistical inferences from being drawn, we highlight that the observed 
trend for eruptions to occur predominantly in the ‘wet’ season mirrors 
the prevalence of dyke intrusions occurring during periods of elevated 
subsurface pore pressure (approximately 60% of historical eruptions 
and recent dykes are associated with rainfall). We repeat the same analy- 
sis using only eruptions defined by the Global Volcanism Program as 
having an explosivity index value (VEI) of two or greater. Although 
there are too few reported larger eruptions to be statistically robust, 
we highlight that, again, the number of eruptions is significantly higher 
(to the one-sigma level) during the ‘wet’ season (four out of five VEI 2+ 
eruptions occurred during the ‘wet’ season; see Extended Data Fig. 3). 


Data availability 

Satellite-derived rainfall data (TRMM and GPM satellite data) are avail- 
able from NASA’s EarthData GES DISC portal (https://doi.org/10.5067/ 
TRMM/TMPA/3H/7). Rainfall gauge data are available from the NOAA's 
National Centers for Environmental Information climate data portal 
(https://www.ncdc.noaa.gov/cdo-web/datasets/GHCND/stations/ 
GHCND:USC00511303/detail). Vertical GPS data are available from 
the Nevada Geodetic Laboratory (http://geodesy.unr.edu/NGLStation- 
Pages/stations/; stations CRIM, AHUP, MKAI and KTPM). Additional 
datasets generated here are available from the corresponding author 
onreasonable request. Sentinel-1 ascending- and descending-track SAR 
acquisitions were obtained through Unavco’s Seamless SAR Archive 
(https://github.com/bakerunavco/SSARA). Vertical displacement 
(velocity) maps of Kilauea for the time periods 2014-2017 and 2018 
are available at https://doi.org/10.5281/zenodo.3459589, alongside 
the Shuttle Radar Tomography Mission (SRTM) digital elevation model 
used for plotting data. 


Code availability 

An archived version of the code required for data access, analysis 
and display is available at https://doi.org/10.5281/zenodo0.3635944. 
Python code is in aJupyter Notebook. Version updates, if applicable, 
will be made available via GitHub: https://github.com/jifarquharson/ 
Farquharson_Amelung 2020 Kilauea-Nature and https://github.com/ 
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Extended Data Fig. 1| Comparison of recorded head change and modelled 
pressure change. a, Rainfall data obtained from the Hawai‘ian Beaches rain 
gauge (refer to Fig. 1a inset for location). b, Data from the Paradise Park well (see 
Fig. 1a inset for the location of the well), digitized from ref. *! over the time 
period October 1992-June 1993. c, Pressure evolution ata depthoflkm 
modelled using rain data froma. Grey bars highlight peaks evident in well-level 
data that are echoed in the modelled data on pore-pressure change. Note that 
well level serves as a proxy for pressure change, dependent on well depth and 
bore, inertia, storage capacity, tidal effects and atmospheric pressure: these 
factors are not considered in this illustrative example. 
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Extended Data Fig. 2 | Diffusion model results. a, Pore-pressure change at 
3km below the surface (1.8 kmb.s.1.) modelled over the periodJanuary 1950 to 
April 2019 for model Q, (data shown since the 1975 Kalapana earthquake). Data 
modelled using gauge data are shown in red; data modelled using satellite data 
are in blue. Four-year averages (dashed lines) are also shown. Vertical bars 
indicate reported intrusion events within the rift zone. K shows the Kalapana 
earthquake; P represents the onset of the Pu‘u ‘O06 eruption; E highlights the 
2018 fissure eruption. The arrow indicates the highest modelled pressure 
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change.b, As for a, but for model Q,, a theoretical high-diffusivity end-member 
scenario. c, As for a, but for model Q, a theoretical low-diffusivity end-member 
scenario. d, PDF of modelled pressure change at depths 1-6 km belowthe 
surface from model Q,. Arrows highlight the pore-pressure front diffusing 
from near the surface (1km) to greater depths over time (months and dates are 
shown). e, As for d, but for model Q, (pressure maxima not shown). f, As ford, 
but for model Q, (pressure maxima not shown). 
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Extended Data Fig. 3 | Predicted binomial distribution of ‘wet’ season highlighted, with a probability of 0.04. Data are also shown for historical 


eruptions at Kilauea. The anticipated means, X,and standard deviations,¢,are eruptions of VEI2 and greater. 
shown. The observed number of historical ‘wet’ season eruptions (36) is 
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Extended Data Table 1| Parameters of models shown in Fig. 3 


Layer Property ee 
a Q; Q> Q3 
k [m?] Loxt0-* 1210x107" $3x107 5.4%107% 
, @ 0.3 0.2 0.2 0.2 
z [km] 0-0.5 0-0.5 0-10 0-10 
Diffusivity [m?s~'] 3.75x10! 5.62x10° 468x107 3.04x 10-7 
k [m?] 6.0x10715 1.0x 107!4 - - 
r p 0.2 0.2 - - 
z [km] 0.5-10 0.5-1.2 . - 
Diffusivity [m?s~!] 3.37x 107? 5.62 x 107! - - 
k [m?] - 1.0 x 10716 - - 
53 o - 0.01 - - 
z{km] - 1.2-10 - . 
Diffusivity [m? s~!] - 1.12 x 107} - . 


Models a and O,., are divided into up to up to three segments, each with a permeability, k, porosity, @, and depth range, z (depth below the surface). Other parameters (K, 7) are kept constant 
across all models. 
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® Check for updates 


As anthropogenic climate change continues the risks to biodiversity will increase over 
time, with future projections indicating that a potentially catastrophic loss of global 
biodiversity is on the horizon’ °. However, our understanding of when and how 


abruptly this climate-driven disruption of biodiversity will occur is limited because 
biodiversity forecasts typically focus on individual snapshots of the future. Here we 
use annual projections (from 1850 to 2100) of temperature and precipitation across 
the ranges of more than 30,000 marine and terrestrial species to estimate the timing 
of their exposure to potentially dangerous climate conditions. We project that future 
disruption of ecological assemblages as a result of climate change will be abrupt, 
because within any given ecological assemblage the exposure of most species to 
climate conditions beyond their realized niche limits occurs almost simultaneously. 
Under a high-emissions scenario (representative concentration pathway (RCP) 8.5), 
such abrupt exposure events begin before 2030 in tropical oceans and spread to 
tropical forests and higher latitudes by 2050. If global warming is kept below 2 °C, less 
than 2% of assemblages globally are projected to undergo abrupt exposure events of 
more than 20% of their constituent species; however, the risk accelerates with the 
magnitude of warming, threatening 15% of assemblages at 4 °C, with similar levels of 
risk in protected and unprotected areas. These results highlight the impending risk of 
sudden and severe biodiversity losses from climate change and provide a framework 
for predicting both when and where these events may occur. 


Climate change is projected to become a leading driver of biodiversity 
loss’, but it is not clear when during this century ecological assemblages 
might suffer such losses, and whether the process will be gradual or 
abrupt. Existing biodiversity forecasts typically lack the temporal 
perspective needed to answer these questions because they indicate 
the number and locations of species threatened by climate change for 
just a snapshot of the future, often around the end of the century’ ?. 
These snapshots do not account for the temporally dynamic nature 
of ecological disruption expected as a result of climate change, often 
focus at the level of species rather than ecological assemblages, and 
can seem remote to decision-makers who are concerned with man- 
aging more immediate risks*. Indeed, many of the most sudden and 
severe ecological effects of climate change can occur when conditions 
become unsuitable for several co-occurring species simultaneously, 
causing catastrophic die-offs and abrupt ‘regime shifts’ in ecological 
assemblages>®. 

Forecasting the temporal dynamics of climate-driven disruption 
of ecological assemblages thus requires quantifying the differences 
among species in the time at which their climate niche limits may be 
locally exceeded. Developing advance warnings of the risk of gradual 
or abrupt ecological disruption is an urgent priority’ °. A temporal 
perspective is also important for adaptation. Reducing emissions and 
delaying the onset of exposure to dangerous climate conditions—even 


by afew decades—could buy valuable time for ecological assemblages 
toadapt””, potentially reducing the magnitude of ecological disrup- 
tion. However, despite the clear importance of atemporal perspective 
in understanding and managing the threats of climate change to biodi- 
versity, we lack a general understanding of the time at which species in 
ecological assemblages will be exposed to climate conditions beyond 
their niche limits. 


The biodiversity climate horizon 


To describe the projected timing of the exposure of species to climate 
conditions beyond their niche, we developed an approach based on 
species historical climate limits and future climate projections. The 
range of climate conditions, over both space and time, under which 
a species has been recorded in the wild demarcates the boundaries 
of its realized niche”. The projected time in the future at which these 
bounds are exceeded owing to climate change at a site can therefore be 
thought of as representing a climate horizon, beyond which evidence 
for the ability of the species to persist in the wild is lacking. Over this 
horizon lies, at best, a sizeable increase in uncertainty about species 
survival and, at worst, local extinction”. For a given species assemblage, 
the cumulative percentage of species over time that have been locally 
exposed to climate conditions beyond their realized niche limits forms 
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Fig. 1| Biodiversity climate horizon profiles. a—e, Horizon profiles (solid 
black lines) indicate the cumulative percentage of species inan assemblage 
exposed to future temperatures (red lines) beyond their realized thermal niche 
over time. Iconic ecosystems provide examples of different profile shapes: 

a, Cayman Islands; b, Coral Triangle; c, Gobi Desert; d, Amazon Basin; e, Congo 
Basin. f, Map of temperature anomalies that shows the locations of the 


what we term the ‘horizon profile’ (Fig. 1). The shape of this horizon 
profile provides information on the potential for climate-driven dis- 
ruption of species assemblages over time—especially the risk of early 
or abrupt disruption—that is not evident when focusing on individual 
climate snapshots. 

We constructed horizon profiles for species assemblages globally, 
delimiting assemblages as the species occurring in 100-km grid cells 
based on expert-verified geographic range maps. A total of 30,652 
species of birds, mammals, reptiles, amphibians, marine fish, ben- 
thic marine invertebrates, krill, cephalopods, and habitat-forming 
corals and seagrasses were included (Supplementary Table 1). We 
used climate projections throughout the twenty-first century from 
22 climate models and 3 RCPs: strong mitigation (RCP 2.6), moderate 
mitigation (RCP 4.5) and a high-emissions scenario (RCP 8.5) (Sup- 
plementary Table 2). Given the importance of temperature as a driver 
of species metabolism and geographic ranges’*®, we focus on mean 
annual temperature as the main proxy for climate. However, because 
species may be sensitive to other climate variables that may respond 
differently to greenhouse gas emissions, we also generated horizon 
profiles using maximum monthly temperatures and terrestrial annual 
precipitation (see Methods). 

For each species at a site (that is, ina 100-km grid cell), we defined 
the local species exposure time as the year after which projected local 
temperatures consistently exceed—for at least 5 years—the maximum 
temperature experienced by the species across its geographic range 
during historical climate projections (1850-2005) (Supplementary 
Fig. 1). For species that breed annually or near-annually, 5 years rep- 
resents a considerable number of breeding seasons at temperatures 
beyond which these species have never been recorded (a 20-year 
window yielded very similar results; Supplementary Figs. 2, 3). This 
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ecosystems in a-e. Horizon profiles and temperature trends are shown fora 
single run of the Hadley Centre Global Environmental Model (HadGEM2) under 
ahigh greenhouse-gas-emissions scenario (RCP 8.5). The profiles differ in 
terms of timing, magnitude and abruptness. The grey lines show historical 
temperature projections ata site. 


approach for quantifying exposure bears similarities to the concept of 
‘time of emergence’ in climate science, defined as the time at whichthe 
signal of anthropogenic climate change at a location emerges fromthe 
envelope of historical climate variability’”°. The key distinction is that 
we define exposure relative to the realized climatic niche limits of each 
species, rather than the historical conditions realized at a single site. 

The shape of horizon profiles, and the potential ecological disruption 
that they imply, can vary substantially across assemblages (Fig. 1). To 
summarize each horizon profile, we focus on three key features: tim- 
ing, the median year for an assemblage in which species exposure to 
unprecedented climate occurs; magnitude, the percentage of species 
locally exposed; and abruptness, the synchronicity in the timing of 
exposure among species in an assemblage, which is measured as the 
percentage of all species exposure times that occur in the decade of 
maximum exposure (Fig. 1a). 


Timing, magnitude and abruptness of horizon profiles 

Under RCP 8.5, 81% of terrestrial and 37% of marine assemblages are 
projected to have at least one species exposed to unprecedented mean 
annual temperatures (that is, beyond historical niche limits) before 
2100. Despite the lower magnitude of warming, the magnitude of 
exposure is greatest in the tropics, where narrow historical climate 
variability”° and shallow thermal gradients” mean that many species 
occur close to their upper realized thermal limits throughout their 
geographic range. In total, 68% of terrestrial and 39% of tropical marine 
assemblages are projected to have more than 20% of their constituent 
species exposed to unprecedented temperatures by 2100, compared 
with 7% of terrestrial and 1% of marine assemblages outside the tropics 
(Fig. 2a). The Amazon, Indian subcontinent and Indo-Pacific regions 
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Fig. 2 | Global variation in the magnitude, abruptness and timing of horizon 
profiles. a, The magnitude of exposure is shown by the percentage of species 
in100-km resolution grid cells (that is, assemblages) that are exposed to 
unprecedented temperature (that is, beyond the realized niche of each species) 
by 2100. b, Abruptness quantified as the percentage of species exposure times 
that occur within the decade of maximum exposure for each assemblage. c, 
Timing quantified as the median year of local species exposure conditional on 
being exposed by 2100, the end of the simulation. Maps show the median value 
across 22 climate models under RCP 8.5 (see Extended Data Fig. 1 for RCP 2.6 
and RCP 4.5). 


are most at risk, with more than 90% of species in any assemblage 
exposed to unprecedented temperatures by 2100 (Fig. 2a). Horizon 
profiles for mean annual temperature and maximum monthly tem- 
perature show strong correspondence (Extended Data Figs. 1, 2). By 
contrast, few species undergo prolonged exposure to unprecedented 
high or low annual precipitation before 2100 (Extended Data Figs. 1, 2), 
whichis in agreement with the greater variability seen in projections 
of precipitation”. Thus, throughout we focus on exposure to changes 
in temperature. 
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The most notable feature of horizon profiles for local assemblages is 
their abruptness (Figs. 1, 2b). Under RCP 8.5, on average 71% (median) of 
local species exposure times for any given assemblage are projected to 
occur withina single decade (Fig. 3a, b), with the abruptness of exposure 
higher among marine assemblages (median abruptness 89%, Fig. 3a) 
than on land (median abruptness 61%, Fig. 3b). This pattern of highly 
synchronized species exposure within assemblages is robust to the 
choice of climate model (for RCP 8.5, median abruptness ranges from 
60% to 79%; Extended Data Figs. 3, 4), emissions scenario (median 
abruptness 83% for RCP 2.6 and 72% for RCP 4.5), metric of abruptness 
(Extended Data Fig. 4), and when calculating exposure for maximum 
monthly temperature (median abruptness 68%) rather than mean 
annual temperature (Extended Data Figs. 1, 2). The same pattern of 
abruptness is also evident for horizon profiles constructed separately 
for each taxonomic group within local assemblages (Extended Data 
Fig. 4). Marine organisms—especially seagrasses, corals, cephalopods, 
marine reptiles and marine mammals—exhibit the most abrupt profiles, 
but it is the consistency of abruptness across groups, rather than the 
differences, that is most notable. Similarly, although the abruptness of 
exposure varies spatially—being greatest in the Amazon, Indian subcon- 
tinent, Sahel and Northern Australia, as well as tropical oceans—abrupt 
horizon profiles are the general rule both within the tropics (median 
abruptness 79%) and at higher latitudes (median abruptness 59%) 
(Fig. 2b). 

This pervasive pattern of abrupt exposure arises primarily because 
co-occurring species often share similar realized thermal limits, 
rather than abruptness being dependent on higher rates of warming 
(Extended Data Fig. 5). This clustering of species-realized thermal limits 
can, in part, be explained by shared geographic barriers or, for tropical 
species, by the upper limits of temperatures available on Earth’. How- 
ever, even where these factors cannot explain the clustering of thermal 
limits because a high percentage of species have warmer temperatures 
available within 1,000 km of their range edge, assemblage exposure is 
still projected to occur abruptly (Extended Data Fig. 5); this suggests 
that other processes, suchas ecological interactions” or evolutionary 
conservatism in fundamental niches”, lead to similarity in realized 
niche limits'®”’ and thus abruptness in the timing of exposure. 

The synchronicity of species exposure within assemblages means 
that the timing of assemblage-level exposure events is well described 
by the median of species exposure times at a site (Extended Data Fig. 6). 
Under RCP 8.5, the global mean year of assemblage-level exposure is 
2074 (+11 years (s.d.)), but there is considerable variation in the timing 
of exposure across assemblages (Fig. 2c). In some locations—such as 
the Caribbean and the Coral Triangle—exposure is predicted to be 
underway already, with these hotspots of exposure expanding in spa- 
tial extent over time (Fig. 2c, Extended Data Fig. 7). By 2050, exposure 
spreads beyond ocean ecosystems to iconic terrestrial ecosystems, 
such as the Amazon, Indonesian and Congolese rainforests (Fig. 2c, 
Extended Data Fig. 7). Notably, the timing of these assemblage-level 
exposure events is not well predicted by the timing of local climate 
emergence (Spearman’s p 0.29; Extended Data Fig. 5); in addition, the 
timing of abrupt exposure events lags behind local climate emergence 
by 42 years (+12 years; mean + s.d.), indicating the potential time-lag 
between climate change and ensuing biotic responses. 

The abrupt exposure of species within ecological assemblages has 
not been detected in earlier projections of climate-driven range loss 
and global species extinctions, which have implied a more gradual 
increase in risk to biodiversity”*. We find that the appearance of a grad- 
ual increase in risk can result from summarizing across local assem- 
blages that differ in their projected timing of abrupt exposure (Fig. 3c, 
d, Extended Data Fig. 8). Although these global summaries mask the 
abrupt nature of exposure within local assemblages, they can high- 
light the importance of increased mitigation efforts in reducing and 
delaying the onset of unprecedented climate conditions. Compared to 
RCP 8.5, achieving RCP 2.6 delays exposure for the most at-risk species 
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Fig. 3 | Abruptness of horizon profiles locally compared with globally, and 
the accelerating risk with global warming. a, b, The distribution inthe 
projected abruptness of species exposure to unprecedented temperatures 
within marine (a) and terrestrial (b) assemblages. Selected assemblages from 
Fig. lare highlighted. Abruptness is quantified as the percentage of species 
exposure times that occur within the decade of maximum exposure, with 
results showing the median across climate models under RCP 8.5.c,d, Global 
horizon profiles for oceans (c) and land (d) show more gradual accumulation of 
species exposure to unprecedented temperatures. Dashed lines show how 
lowering emissions from RCP 8.5 to RCP 2.6 both reduces the median 


by approximately 6 decades in the oceans (mean 58 years, range 46-65 
years; Fig. 3c) and on land (mean 58 years, range 49-67 years; Fig. 3d), 
buying valuable time for species and ecosystems—and human societies 
that depend on them—to adapt to a warming climate. 


The risk of abrupt exposure events 


The abruptness of horizon profiles is positively correlated with the 
magnitude of exposure (Spearman’s p 0.58; Extended Data Fig. 6), which 
indicates that exposure events involving larger fractions of species are 
projected to occur more abruptly. This near-simultaneous exposure 
among multiple species could have sudden and devastating effects on 
local biodiversity and ecosystem services. Catastrophic, multi-species 
coral die-offs caused by a record-breaking marine heatwave in 2016 
are one recent example®. Although the ‘safe limits’ of species loss— 
at which ecosystem function can be maintained— remain uncertain, 
meta-analyses suggest a 20% decline in species diversity as one pos- 
sible threshold?*”’. We therefore defined assemblages at risk of abrupt 
ecological disruption as those in which at least 20% of species are pro- 
jected to undergo exposure to unprecedented temperatures within 
the same decade. Restricting global warming to less than 2 °C above 
pre-industrial levels limits such abrupt assemblage exposure events to 
less than 2% of assemblages (Fig. 3e). However, beyond 2 °C warming, 
the area projected to undergo abrupt assemblage exposure expands 
rapidly, encompassing 15% of assemblages globally at 4 °C warming. 
Furthermore, the increase in abrupt exposure does not differ markedly 
for assemblages that are afforded high habitat protection (at least 20% 
protected area coverage of a grid cell), indicating that current protected 
areas are equally at risk from abrupt exposure (Fig. 3e). 

The risk of abrupt exposure events differs across assemblages glob- 
ally, with variability across individual climate projections increasing 
the total area at risk compared with median projections. For instance, 


Ocean === = 


magnitude of exposure across climate models and substantially delays the 
timing of exposure, buying about 60 years for species and conservation plans 
to adapt toa warming climate (see Extended Data Fig. 8 for individual climate 
models). e, The percentage of species assemblages projected to experience 
high-magnitude and abrupt assemblage exposure (more than 20% of species 
exposed ina single decade) asa function of global mean temperature increase 
relative to pre-industrial levels (1850-1900). Curves are fitted from model runs 
(n= 66) across RCP 2.6, RCP 4.5 and RCP 8.5. ‘Current policy pledges’ referstoa 
scenario in which countries implement their current nationally determined 
contributions to 2030 and make no further emissions reductions. 


even under RCP 2.6 (1.75 °C mean warming), 9% of assemblages are at 
some risk of abrupt exposure (Fig. 4a), and this increases to 35% of 
assemblages under RCP 8.5 (4.4 °C mean warming; Fig. 4b). The risk 
of abrupt assemblage exposure events is positively correlated with 
species richness (RCP 8.5, Spearman’s p 0.29 (land) and 0.56 (ocean)), 
highlighting the increased risk of sudden ecological disruption in the 
world’s most biodiverse ecosystems. Moreover, the risk of disruption 
of ecological function may be underestimated in this analysis because 
even if particular functional groups (for example, habitat-forming cor- 
als) suffer high levels of exposure, this may not be evident at the scale 
of entire assemblages if other groups are relatively less affected. When 
abrupt assemblage exposure events are instead defined at the level of 
major taxonomic groups, the area at risk expands further, encompass- 
ing 49% of species assemblages under RCP 8.5 (Fig. 4c, Extended Data 
Fig. 9). Our approach estimates how much of the original biodiversity of 
an assemblage is exposed to potentially dangerous climate conditions 
over time”’, We do not consider the potential for immigration of spe- 
cies from elsewhere to offset local biodiversity losses; however, abrupt 
assemblage-wide exposure is likely to cause substantial ecological 
disruption regardless of the rate at which new species arrive. Further- 
more, in tropical lowlands and oceans—where projected exposure is 
greatest and species adapted to warmer environments are lacking—net 
declines in local biodiversity are expected”. 


Crossing the biodiversity climate horizon 

Although the horizon profile describes the accumulating number of 
co-occurring species that are exposed to conditions beyond their real- 
ized niche limits, this need not equate witha profile of local extinction. 
Species may have wider fundamental niche limits than realized niche 
limits’**°, may avoid exposure in microclimatic refugia (however, see 
Extended Data Fig. 10) or through behavioural thermoregulation”), 
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Fig. 4| The risk of high-magnitude, abrupt assemblage exposure events. 
a-c, Riskis shown for all species under RCP 2.6 (a), all species under RCP 8.5 (b) 
and single taxonomic groups under RCP 8.5 (c). Riskis calculated as the 
proportion of 22 climate models in which an abrupt exposure event is 
projected to occur before 2100. Assemblages that avoid abrupt exposure 
events across all 22 models are in grey. Ina, b, abrupt exposure events are 
defined as when more than 20% of all species in an assemblage are exposedina 
single decade. Inc, abrupt exposure events are defined when any single group 
of organisms (for example, amphibians or corals) within an assemblage 
experiences the exposure of more than 20% of its constituent species ina single 
decade, highlighting the widespread risk of abrupt ecological disruption. 


or may evolve to tolerate new conditions”. In these cases, the timing of 
abrupt assemblage exposure events could be considered an ‘ignorance 
horizon’—the time beyond which local extinctions are not inevitable 
but evidence for the ability of species to persist in the wild is largely 
absent”. Thus, at the very least, our results show that within 30 years, 
continued high emissions will drive a sudden shift across many ecologi- 
cal assemblages to climate conditions under which we have almost no 
knowledge of the ability of their constituent species to survive. We 
caution that the timing and magnitude of this exposure may occur 
earlier and be larger than we anticipate, because our analysis does 
not consider changes in extreme events’, effects of warming on local 
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habitat (for example, melting sea ice), covariation between climate 
variables™, or that populations may be locally adapted”. 

Furthermore, to the extent that species-realized historical thermal 
limits do reflect fundamental limits to persistence, then the occur- 
rence of abrupt exposure events marks the crossing of an ‘ecological 
horizon’ beyond which catastrophic and coordinated species losses are 
expected. These abrupt events—projected to spread from ocean (for 
example, coral reef) to land (for example, rainforest) ecosystems by 
2050 under high emissions—risk sudden disruption to ecosystems and 
their capacity to maintain current levels of biodiversity and function- 
ing. Evidence from laboratory-based and field-based studies indicates 
this is a credible risk, particularly for tropical terrestrial ectotherms 
and for marine organisms for which projected abruptness is most pro- 
nounced and for which realized geographic range boundaries most 
closely match thermal tolerance limits’*"**°™. Indeed, warming over 
recent decades has already been associated with marked population 
declines and local extinctions®**°—even among endotherms, which are 
widely assumed to be less sensitive to warming but may be particularly 
vulnerable to climate-driven disruption of trophic interactions*”*’. 
For those ecosystems for which exposure is projected within the next 
few decades, the capacity for species to adapt would appear limited. 
A priority for future research is to refine estimates of the timing and 
consequences of exposure, including for regions in which factors other 
than temperature may more strongly constrain species ranges, and for 
which the emergence of novel climates has closest analogues deep in 
Earth’s history”. 

Considering the temporal dynamics of the exposure of biodiversity 
toclimate change provides an early warning of the potential for abrupt 
ecological disruption. Averting—or at least delaying—the crossing of 
this ecological horizonis possible for most assemblages, and requires 
massive and rapid reductions in greenhouse gas emissions. Our results 
also highlight the urgency of targeted management responses, includ- 
ing establishing monitoring sites in exposed regions, establishing new 
protected areas in refugia, and investigating the potential of assisted 
migration and adaptation. 
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Methods 


Biodiversity data 

We used expert-verified range maps for 30,652 species from the Inter- 
national Union for Conservation of Nature (IUCN)“ and BirdLife Inter- 
national*®, including birds, mammals, reptiles, amphibians, marine 
fish, benthic marine invertebrates, and habitat-forming corals and 
seagrasses (Supplementary Table 1). To further increase coverage of 
open-ocean assemblages, our sample includes additional data on krill 
and cephalopods”, reflecting the availability of expert range maps for 
oceanic species*. We used only breeding ranges for terrestrial species. 
We excluded marine species that are restricted to depths greater than 
200 m (the lower limit of the epipelagic zone), as these species are less 
likely to respond to changes in sea surface temperature. Range maps 
were converted to 100-km-resolution equal-area grid cells—the finest 
resolution justifiable for these data globally without incurring false 
presences**“5, Expert range maps provide comprehensive information 
onthe global geographic distributions of species*®, but our results 
should be interpreted in the context of known data limitations. For 
some groups, species coverage is incomplete and is biased towards 
commercial species (for example, cephalopods), whereas others have 
been comprehensively assessed for only a subset of clades (for example, 
fish) and the species included in our study thus represent anon-random 
subset of global biodiversity (Supplementary Table 1). For instance, 
insects and plants may on average be more at risk of geographic range 
loss due to climate change than are terrestrial vertebrates’, but we did 
not assess exposure for these groups because range maps (expert or 
otherwise) are not available globally. As such, both very short-lived 
and long-lived terrestrial taxa may be underrepresented in our sample. 
Furthermore, although many IUCN range maps consider occurrence 
data from historical records, others may underestimate climate niche 
limits where longer-term historical records are unavailable and recent 
geographic range contractions have occurred in part due to reasons 
other than climate change”. 

Data on marine and terrestrial protected areas were downloaded 
from the World Database on Protected Areas (http://protectedplanet. 
net/; accessed 21 March 2018). The maps, originally in polygon format, 
were resampled to a 1-km resolution before further analysis. We con- 
sidered 100-km-resolution grid cells highly protected if at least 20% 
of the grid cell was inside protected areas. 


Climate model data 
We used temperature and precipitation projections from 22 General 
Circulation and Earth System Models developed for the Coupled Model 
Intercomparison Project 5 (CMIP5) (Supplementary Table 2). For each 
model, we downloaded a single projection for mean monthly precipi- 
tation (mm), near-surface temperature (K) and sea surface tempera- 
ture (K) for the historical run (1850-2005), as well as RCP 2.6, RCP 4.5 
and RCP 8.5 scenarios for the years 2006-2100 or 2006-2300, when 
available. Model output was downloaded from https://esgf-node.IInl. 
gov/projects/esgf-IInl/ (accessed 5 June 2017). In our main analysis, 
we focus on the dynamics of exposure according to mean annual tem- 
perature (MAT), calculated by averaging monthly values. However, we 
also repeated our analysis using the temperature of the hottest month, 
hereafter denoted maximum monthly temperature (MMT), and, for 
terrestrial assemblages, total annual precipitation (mm), calculated 
by summing precipitation values across months (see Supplementary 
Information). Note that the identity of the hottest month can vary both 
across sites and between years within a site. Given that CMIP5 models 
use different spatial grids, and to match the resolution of species geo- 
graphic range data, climate model data were regridded to al100-km 
resolution grid using an area-weighted mean interpolation. Climate 
data interpolation was performed in CDO* and R®. 

We calculated species exposure times for each assemblage using indi- 
vidual climate simulations, as opposed to ensembles or multi-model 


averages, because individual simulation runs include variance in cli- 
matic time series due to internal climate variability such as the timing 
of EI Nifo/Southern Oscillation events”. This internal variability is 
a key component of the uncertainty in the timing of exposure, and is 
smoothed out if using multi-model averages as input into the analy- 
sis. By calculating species exposure events using individual model 
simulation runs and then summarizing across models, we capture 
the uncertainty in the timing of exposure due to both internal climate 
variability and climate model uncertainty (that is, uncertainty about 
climate physics across models), inline with ‘time of emergence’ analyses 
from climate science”. Throughout, we report multi-model medians 
in each of our summary metrics. 


Defining species-realized niche limits 

Species experience variability in climatic conditions across both space 
and time, but this temporal variability is ignored when using 
time-averaged climate conditions (for example, Worldclim data*) to 
estimate species-realized niches. To address this, we estimated 
species-realized niche limits using the climate projections from the 
historical run of each climate model (1850-2005), which includes the 
influence on climate of observed changes in radiative forcing due to 
natural factors such as volcanic eruptions, as well as anthropogenic 
emissions and land-use changes”. Thus, in the case of MAT, we calcu- 
lated the maximum MAT experienced across the species geographic 
range over both space and time (Trax yaar? see Supplementary Informa- 
tion). To prevent estimates Of Tnax,,,, being inflated by either extreme 
outliers in the temperature time series or from the overestimation of 
species ranges“, we excluded outlier temperature values within each 
grid cell, defined as those more than three standard deviations from 
the mean. After we had selected the maximum temperature for each 
cell, we excluded outlier temperature values across each species range, 
defined as those more than three standard deviations above the mean 
range value. The 7,,ax,,,, Value for each species was then set as the max- 
imum of the remaining values (Supplementary Fig. 1). We used an iden- 
tical procedure to calculate 7,,,,,, using MMT (Trax sane" For precipitation, 
species may be exposed to either drying or wetting conditions and so 
we calculated both the minimum (P,,,,) and maximum (P,,,,) precipita- 
tion values experienced by each species across its geographic range 
(see Supplementary Information). 


Estimating species exposure times 

Within each terrestrial (n =18,560) and marine (n =37,333) assemblage 
(that is, 100-km grid cell containing any terrestrial or marine species, 
respectively), we defined the time of local species exposure to unprec- 
edented temperature (that is, the climate horizon) to be the year after 
which the MAT (or MMT) of the cell is projected to exceed the 7,,..,,_, 
(OF Tinaxyy,,) Value of the species for at least five consecutive years. We 
note that using a higher number of consecutive years (n = 20 years) 
had little effect on the magnitude, timing or abruptness of exposure 
(Supplementary Figs. 2, 3). 

For precipitation, we calculated the time of local species exposure 
as the year after which the precipitation of the cell is projected to be 
either greater or less than the P,,,, and P,,, values, respectively, of the 
species for at least five consecutive years. Annual precipitation values 
are bounded at zero, and this could potentially lead to exposure being 
underestimated for locations projected to have historically received 
zero precipitation. To address this, we additionally defined exposure 
to occur when annual precipitation decreased to less than 15 mm for 
at least five consecutive years. Owing to the generally weaker trends 
and high variability in historical and future projected precipitation, 
we found that few species were exposed to unprecedented precipita- 
tion regardless of how exposure was defined (Extended Data Fig. 1). To 
showtheimportance of increasing temperatures as the primary driver 
of exposure, we compared patterns of exposure from MAT alone to 
those from MAT and precipitation combined, recording the earliest 


local exposure time of either MAT or precipitation for a species in an 
assemblage when it was exposed to both variables (Extended Data 
Fig. 2, see Supplementary Information). 

We note that by using range-wide estimates of species niche limits, 
we may underestimate both the magnitude and the immediacy of expo- 
sure if populations are locally adapted®. Unfortunately, information 
on the scale and strength of local adaptation is not generally avail- 
able across species. Equally, our analysis does not attempt to model 
adaptive evolution, which may enable species to shift or expand their 
climatic niche limits over time. Nevertheless, our estimates of the tim- 
ing of local exposure to unprecedented conditions may be relevant for 
understanding the potential for evolution to rescue populations from 
changing climates’. 


Horizon profiles 

When species exposure times had been calculated for an assemblage, 
we constructed a horizon profile indicating the cumulative percentage 
of species that are locally exposed to conditions beyond their realized 
niche limits. We used the following metrics to summarize the temporal 
dynamics of biodiversity exposure. First, we calculated the magnitude 
of exposure as the percentage of species in the assemblage exposed 
over the course of the twenty-first century. Second, the abruptness 
of exposure for an assemblage was calculated as the percentage of all 
exposure times that occur in the decade of maximum exposure. We 
identified the decade of maximum exposure using a moving window 
of ten years. We also calculated an alternative metric of abruptness 
based on the Shannon entropy index®, which quantifies the evenness 
inthe distribution of exposure times across all decades of the horizon 
profile (Extended Data Fig. 4). In contrast to our original abruptness 
metric, lower values of the Shannon entropy index indicate a more 
abrupt profile. We therefore rescaled the Shannon entropy index by 
the maximum possible entropy value per assemblage, subtracted these 
values from 1 and then multiplied by 100 to give an index that ranged 
between 0 and 100, where a value of 100 indicates that all exposure 
times occur ina single decade and a value of 0 corresponds to an equi- 
table distribution of exposure times across decades. Abruptness was 
calculated only for assemblages in which five or more species were 
exposed, to avoid idiosyncrasies due to small sample sizes. Third, the 
timing of exposure for each assemblage was calculated as the median 
of the times of local species exposure events. Species not exposed 
before the end of the twenty-first century were excluded from this cal- 
culation. We repeated our analysis using alternative metrics of timing, 
including the mean year of exposure and the mid-point of the decade 
of maximum exposure, obtaining very similar results (Extended Data 
Fig. 6). For each of these exposure metrics we report the median value 
across the 22 climate models for a given climate scenario, and quantify 
uncertainty as the standard deviation (Extended Data Fig. 3). The great- 
est uncertainty in projected effects involves the magnitude of exposure 
along the boundaries of the tropics. This arises because of variation 
among models in the magnitude of warming, which alters the spatial 
extent of regions exposed to unprecedented temperatures. By contrast, 
variation among models in the timing and abruptness of exposure is 
relatively small and does not exhibit any clear spatial structure. 

We compared the median timing of species exposure within assem- 
blages to the timing of local climate emergence, defined as the year after 
which future local temperatures are projected to exceed the maximum 
historical (1850-2005) conditions at a site!””°. Timing of emergence 
was calculated using an identical procedure to that used to calculate 
the timing of exposure, excluding outlying values from the time series 
when quantifying the maximum historical temperature at a site and 
considering emergence only when temperatures exceed the historical 
maximum for at least five consecutive years. The time of local climate 
emergence ata site is therefore identical to the time of local exposure 
for a species occupying a single grid cell. In the absence of perfect 
adaptation to local climates, atime-lagis therefore expected between 


local climate emergence and the median timing of exposure, because 
species typically persist under a broader range of conditions than is 
present in any single site. 


Spatial scale 

We modelled species-realized niche limits using climate projections at 
100-km grain size, matching the resolution of expert geographic range 
maps“*“’. However, individual grid cells at this resolution may contain 
(potentially substantial) spatial climatic heterogeneity, thus poten- 
tially underestimating variability in species niche limits and potentially 
overestimating the abruptness of assemblage exposure dynamics. To 
investigate this possibility, we tested whether the abruptness of horizon 
profiles across terrestrial assemblages is related to the range in the MAT 
within each grid cell, using spatially interpolated temperature data for 
the period 1970-2000 available at 1-km resolution. We found that 
abruptness is negatively correlated with the spatial heterogeneity in 
temperature within a cell (Spearman’s p -0.29), so that assemblages 
with higher spatial heterogeneity in temperatures (for example, tropical 
mountains), exhibit more gradual exposure profiles than those with 
low heterogeneity in temperatures (for example, tropical lowlands) 
(Extended Data Fig. 10). This result has two important implications. 
First, it suggests that—despite the relatively coarse grain size—our 
analysis still identifies those assemblages in which variation in real- 
ized niche limits among species is expected to be greatest (that is, 
grid cells containing substantial spatial climatic heterogeneity) as 
having the most gradual exposure profiles. Second, it suggests that, 
although incorporating finer-scale climate data may further reduce 
the lowest abruptness values estimated across assemblages (that is, 
making relatively gradual horizon profiles more gradual), it is unlikely 
to alter the key conclusion that assemblage exposure to climate warm- 
ing occurs abruptly, because the most abrupt horizon profiles occur 
in assemblages in which there is little fine-scale climatic heterogeneity 
(Extended Data Fig. 10). These results support the robustness of our 
overall conclusions regarding the dynamics of exposure, but it is clear 
that increasing the spatial resolution at which species niche limits and 
assemblages are defined would enable a more precise quantification 
of the timing of species exposure to changing climates, and should be 
a priority for future research. 

Horizon profiles can be calculated either for a single assemblage 
or for a set of assemblages combined, such as a biome or the entire 
globe. In addition to examining the dynamics within assemblages, 
we generated global horizon profiles, describing the total cumula- 
tive exposure of all populations (that is, species by site combinations) 
across marine and terrestrial assemblages (Fig. 3c, d). To avoid expo- 
sure dynamics being driven by the small number of species with the 
largest geographic ranges, we weighted each species by the inverse of 
its geographic range size. This range-size-weighted exposure profile 
ensures that each species contributes equally to exposure dynamics, 
and is mathematically equivalent to calculating the mean percentage 
geographic range exposure across species. Unweighted global horizon 
profiles show qualitatively similar patterns (Extended Data Fig. 8). 


Risk of abrupt exposure events 

We identified those assemblages projected to undergo abrupt and 
high-magnitude exposure events, defined as at least 20% of resident 
species exposed within a single decade before the end of the twenty-first 
century. Across the set of 66 climate model runs from the 3 RCP sce- 
narios, we fit a generalized additive model to estimate the percentage of 
assemblages projected to undergo abrupt exposure events as a function 
of mean global warming at the end of the century (2080-2100) rela- 
tive to pre-industrial conditions (1850-1900). We fit separate models 
for sites with either low or high (that is, greater than 20% in protected 
areas) levels of habitat protection. We forced the regression through 
the origin, thus assuming no abrupt exposure events would occur if 
temperatures remained stable at pre-industrial conditions. Because 
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the identity of assemblages projected to undergo abrupt exposure 
events may vary across model runs, the actual area at risk of abrupt 
exposure may be substantially greater than expected under any single 
climate simulation. For each assemblage, we therefore calculated the 
probability of an abrupt exposure event across the 22 climate models 
within each emissions scenario. We did this for assemblages consisting 
of all species, as well as for each group of organisms separately. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All datasets used here are publicly available. Expert verified range 
maps are available from https://www.iucnredlist.org/resources/ 
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requestdis. Climate change projections for RCP 8.5, RCP 4.5 and RCP 
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Extended Data Fig. 1| Spatial distribution of the magnitude, abruptness and timing of assemblage exposure for alternative climate variables. a~c, Shown 
is the median value across 22 CMIP5 climate models for MAT (a), MMT (b) and precipitation (c) under RCP 2.6, RCP 4.5 and RCP 8.5. 
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Extended Data Fig. 2 | Comparing the magnitude, timing and abruptness of 
assemblage exposure across alternative climate variables. a—c, Patterns of 
exposure to both MAT and precipitation combined are very similar to patterns 
of exposure to MAT only, highlighting the importance of changes in 
temperature in driving exposure. d-i, Patterns of exposure to unprecedented 
temperatures show both similarities and differences depending on whether 
temperature is quantified using MAT or MMT. More species are exposed and 
exposure occurs earlier for MAT compared with MMT, but spatial variation in 
the magnitude (d, g) and timing (e, h) of exposure are strongly correlated 


between temperature variables. Variation in the abruptness of assemblage 
exposure is less strongly correlated between MAT and MMT (f), but both 
variables confirm the abruptness of projected exposure (i). Values are the 
median across 22 CMIP5 climate models under RCP 8.5, with hotter colours 
indicating a higher density of points. Points falling along the dashed 1:1line 
indicate a perfect correspondence between metrics. The correlation between 
metrics (Spearman’s p), and the mean difference in the timing of exposure 
(years), is shown. 
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Extended Data Fig. 3| Uncertainty in species local exposure metrics across 
22 CMIPS climate models under RCP 8.5. Uncertainty (standard deviation, 
SD) inthe magnitude of exposure is greatest around the boundaries of the 
tropics, with little geographic variation in uncertainty in timing or abruptness. 
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Extended Data Fig. 4| Abruptness of horizon profiles. Density plots (left) 
show the distribution of abruptness values for different CMIP5 climate models 
(n=22, lines) and RCPs on land (red) and in the ocean (blue). Histograms (right) 
show the median abruptness across climate models under RCP 8.5 for each 
group of organisms. Abruptness is calculated as the percentage of exposure 
times occurring within the decadal window of maximum exposure (colours). 


Abruptness is also shown for an alternative metric based on the Shannon 
entropy index (grey) with values scaled between 0 and 100, indicating the most 
gradual andthe most abrupt distribution of exposure times possible for agiven 
assemblage, respectively. Exposure is consistently abrupt across climate 
models, RCP scenarios, metrics and organism groups. 
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Extended Data Fig. 5 | Predicting the timing, magnitude and abruptness of 
local species exposure. a-h, On land (left) and in the ocean (right) the median 
timing of exposure (a, b) is weakly correlated (Spearman’s p) with the timing of 
local climate emergence. The magnitude of exposure (c, d) is weakly correlated 
with the magnitude of warming between the start (2000-2020) and the end 
(2090-2100) of the twenty-first century. The abruptness of exposure 
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(percentage of local species exposure times that occur in the decade of 
maximum exposure) is only partly correlated with the maximum rate of 
warming (maximum difference in mean temperature between successive 
decades) (e, f) or the percentage of species with nowhere warmer within 1,000 
km of their range (g, h). Values are the median across 22 CMIPS climate models 
under RCP8.5. Hotter colours indicate a higher density of points. 
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Extended Data Fig. 6| The different dimensions of climate risk to species 
assemblages. a-c, Bivariate plots showing the strong correlation among 
alternative metrics for the timing of local assemblage exposure: the median 
year of local species exposure, the mean year of local species exposure and the 
mid-point of the decadal window of worst (that is, maximum) local species 
exposure. d-f, Bivariate plots showing the weaker correlation between the 


magnitude, abruptness and timing of exposure across assemblages. Values are 
the median across 22 CMIP5 climate models under RCP 8.5, with hotter colours 
indicating a higher density of points. Ina-c, points falling along the dashed 1:1 
line indicate a perfect correspondence between metrics. The correlation 
between metrics (Spearman’s p) is shown, as well as (for a—c) the mean 
difference in the timing of exposure (years). 
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Extended Data Fig. 7 | Accumulation of exposure to unprecedented climate models under RCP 8.5, highlighting the immediate onset of exposure in 
temperatures at decadal time snapshots from 2030 to 2100. Light grey the tropics that spreads to higher latitudes later inthe century. 
indicates zero local species exposure. Maps show the median across 22 CMIP5 
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Extended Data Fig. 8 | The global biodiversity horizon profile. a-d, The 
cumulative exposure to unprecedented temperatures of all local species 
populations (that is, species X site aggregated across all sites) increases 
smoothly over time at the global scale. Global horizon profiles are shown when 
species are weighted by the inverse of their geographic range size (equivalent 


RCP4.5 


oO 
a 
° 
ro) 


Qn oo 
oooe°7o 


BN 
o 


30 


mean geographic 
range exposure (%) 


100 


% local species 
populations exposed =» 


to the mean percentage of the geographic range exposed) (a, b) or are given 
equivalent weighting (d-f). In d-f, dynamics are dominated by species with 
many local populations (that is, large geographic ranges). Variability in 
exposure across 22 climate models (thin lines) is shown for each RCP scenario 
(median, thick line). 


RCP2.6 : all species 


. ae a 
: =. fer 


RCP8.5 : all species ; 


so SR a 


0 0.045 0.1 0.2 0.3 0.4 


Extended Data Fig. 9 | The global distribution in the risk of high-magnitude 
and abrupt assemblage exposure events under different representative 
concentration pathways. Maps show the probability of abrupt exposure 
calculated across 22 CMIPS5 climate models. The risk of abrupt exposure was 
calculated on the basis of all species in an assemblage (left column) and for each 
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group of organisms separately (right column). The maps highlight the greater 
risk of abrupt exposure events under intermediate (RCP 4.5) and especially 
under high (RCP 8.5) emission pathways, and when considering taxonomic 
groups separately. 
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Extended Data Fig. 10| Abruptness of horizon profiles for terrestrial 
vertebrates in 100-km grid cells with low or high spatial temperature 
heterogeneity. Red, low heterogeneity; grey, high heterogeneity. Abruptness 
is calculated as the percentage of species exposure times in the decade of 
maximum exposure. Temperature heterogeneity is the range in temperatures 
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at 1-km resolution within each 100-km cell. Assemblages with abrupt exposure 
have lower temperature heterogeneity, which suggests that quantifying 
species niches at finer spatial resolutions is unlikely to alter the abrupt nature 
of assemblage exposure dynamics. 
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All datasets used are publicly available. We used expert verified range maps for 30,652 species from the International Union for Conservation of Nature (https:// 
www. iucnredlist.org/resources/spatial-data-download) and BirdLife International (http://datazone.birdlife.org/species/requestdis) including; birds, mammals, 
reptiles, amphibians, marine fish, benthic marine invertebrates, and habitat forming corals and seagrasses. Climate change projections for RCPs 8.5, 4.5, and 2.6 for 
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Study description 


Research sample 


Sampling strategy 


Data collection 


Timing and spatial scale 


Future climate projections from earth system models are combined with information on species geographic distributions (n=30,652 


species) to estimate species realised thermal niche limits and project the timing of future exposure to conditions beyond their niche. 


We used expert verified range maps for 30,652 species from the International Union for Conservation of Nature (https:// 

www. iucnredlist.org/resources/spatial-data-download) and BirdLife International (http://datazone.birdlife.org/species/requestdis), 
including; birds, mammals, reptiles, amphibians, marine fish, benthic marine invertebrates, and habitat forming corals and 
seagrasses. This sample reflects availability of geographic range data for each organisms group globally. 


The sample size reflects availability of geographic range data for each organisms group. These sample sizes make up the majority of 
species in each major taxonomic group included in the study. 


All data used here is already published and was downloaded from public data portals 


The data on species distributions we use represents more than a century of collecting efforts and observations by scientists, 


naturalists and the public. The climate data we use is generated by simulations from computer models of the earth system. Both 
kinds of data are accurate to ~100km resolution and are available globally. 


Data exclusions We excluded marine species restricted to depths >200m to focus on the effects of sea surface temperatures on shallow water 
species. To prevent estimates of maximum temperature being inflated by either extreme outliers in the temperature time series or 
from the overestimation of species ranges we excluded outlier temperature values within each grid cell, defined as those more than 
three standard deviations from the mean. Once we had selected the maximum temperature for each cell, we excluded outlier 
temperature values across each species range, defined as those more than three standard deviations above the mean range value. 
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The results can be reproduced using the publicly available data and the analysis code is available on request from the authors. Code 
and results data to reproduce Figures 2-4 is available at figshare (10.6084/m9.figshare.11814633). 


Randomization No randomization was required. Our study was not experimental, but based on observed biodiversity and climate patterns. 
Blinding Our study was not experimental and so blinding is not relevent 
Did the study involve field work? Yes No 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


> 
jad) 
2 
= 
= 
a 
= 
o 
Nn 
© 
red) 
= 
(a) 
= 
= 
o 
18. 
fo) 
oa 
D 
a 
Wn 
(S 
3 
3 
jad) 
5 
< 


Article 


Extant timetrees are consistent witha 
myriad of diversification histories 


https://doi.org/10.1038/s41586-020-2176-1 


Received: 14 September 2019 


Stilianos Louca'’?™ & Matthew W. Pennell®*™ 


Accepted: 10 March 2020 


Published online: 15 April 2020 


® Check for updates 


Time-calibrated phylogenies of extant species (referred to here as ‘extant timetrees’) 
are widely used for estimating diversification dynamics'. However, there has been 
considerable debate surrounding the reliability of these inferences” * and, to date, this 


critical question remains unresolved. Here we clarify the precise information that can 
be extracted from extant timetrees under the generalized birth-death model, which 
underlies most existing methods of estimation. We prove that, for any diversification 
scenario, there exists an infinite number of alternative diversification scenarios that 
are equally likely to have generated any given extant timetree. These ‘congruent’ 
scenarios cannot possibly be distinguished using extant timetrees alone, even inthe 
presence of infinite data. Importantly, congruent diversification scenarios can exhibit 
markedly different and yet similarly plausible dynamics, which suggests that many 
previous studies may have over-interpreted phylogenetic evidence. We introduce 
identifiable and easily interpretable variables that contain all available information 
about past diversification dynamics, and demonstrate that these can be estimated 
from extant timetrees. We suggest that measuring and modelling these identifiable 
variables offers amore robust way to study historical diversification dynamics. Our 
findings also make it clear that palaeontological data will continue to be crucial for 
answering some macroevolutionary questions. 


Acentral challenge in evolutionary biology is to reconstruct rates of 
speciation and extinction over time’. Unfortunately, the majority of 
taxa that have ever lived have not left much trace in the fossil record, 
and the primary source of information on their past diversification 
dynamics therefore comes from extant timetrees. Many methods 
have been developed for extracting this information; most methods 
fit variants of a birth-death process’*. Despite the popularity of these 
methods, which collectively have been used in thousands of studies’ °, 
their reliability has been called into question by comparisons with 
fossil-based estimates!***"", The reasoning behind these critiques 
is that there may be insufficient information in extant timetrees to 
fully reconstruct historical diversification dynamics. However, this 
critical issue has remained unresolved; it is unknown precisely what 
information on speciation and extinction rates is contained in extant 
timetrees. 

Here we present a definite answer to this question for the general 
stochastic birth-death process with homogeneous (that is, lineage- 
independent) rates, in which speciation (‘birth’) rates (A) and extinc- 
tion (‘death’) rates (1) can vary over time, that underlies the majority 
of existing methods for reconstructing diversification dynamics from 
phylogenies. We mathematically show that, for any given candidate 
birth-death model, there exists an infinite number of alternative birth- 
death models that can explain any extant timetree equally as well as 
can the candidate model. These alternative models may appear to be 
similarly plausible and yet exhibit markedly different features, such as 


different trends through time in both A and yp. This severe ambiguity 
persists for arbitrarily large trees and cannot be resolved even withan 
infinite amount of data; it is thus impossible to design asymptotically 
consistent estimators for A and yw. Using simulated and real timetrees 
as examples, we demonstrate how failing to recognize this issue can 
seriously mislead our inferences about past diversification dynamics. 
We present appropriately modified variables that are asymptotically 
identifiable and that contain all available information on historical 
diversification dynamics. 


Lineages through time 


Animportant feature of extant timetrees is the lineages-through-time 
curve (LTT), which counts the number of lineages at each time in the 
past that are represented by at least one sampled extant descending 
species in the tree. The likelihood of a tree under a given birth-death 
model, the LLT of the tree and the LTT that would be expected under 
the modelare linked as follows. Any given combination of (potentially 
time-dependent) speciation and extinction rates (A and y, respectively) 
and the probability that an extant species will be included in the tree 
(‘sampling fraction’) (p) can be used to define a deterministic diver- 
sification process, in which the number of lineages through time no 
longer varies stochastically but instead according to a set of differ- 
ential equations®” (Supplementary Information section S.1). Given 
a number of extant sampled species (M,), the LTT predicted by these 
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differential equations (the deterministic LTT (dLTT)) corresponds to 
the LTT that is expected for trees generated by the original stochastic 
model! The likelihood of a tree under a given birth-death model 
can be written purely in terms of the LTT of the tree and the dLTT of 
the model (see Supplementary Information section S.1.2 and ref. ? 
for derivations). This means that any two models with the same dLTT 
(conditioned on M,) yield identical likelihoods for the tree. We therefore 
call two models ‘congruent’ if they have the same dLTT for any given 
M,. Any two models are either congruent or non-congruent, regard- 
less of any particular data considered (Supplementary Information 
section S.1). The probability distribution of tree sizes generated by a 
model, when conditioned on the age of the stem or crown, is identical 
among congruent models (Supplementary Information section S.1.7). 
Hence, congruent models have equal probabilities of generating any 
given timetree, analogous to how congruent geometric objects exhibit 
similar properties (discussion in Supplementary Information section 
S.1.8). Although the mathematical relationship between the dLTT ofa 
modeland its likelihood has been known”, its implications for macro- 
evolutionary inference have remained unexamined and—as we show 
below—severely underestimated. 


The breadth of congruent model sets 


When seenasarandom variable, extant timetrees have the same prob- 
ability distribution under any two congruent models. Therefore, 
in the absence of further information, congruent models cannot 
possibly be distinguished solely on the basis of extant timetrees— 
neither through the likelihood nor any other test statistic. For any 
birth-death model, this leads to four important unresolved ques- 
tions: how many alternative congruent models there are, how dif- 
ferent these congruent models are from one another, how many of 
these congrument models correspond to plausible scenarios, and 
how these scenarios can be explored. To answer these questions, we 
present an alternative method for recognizing congruent models 
(full details are provided in Supplementary Information section S.1.1). 
Given a number of sampled species (M,), the dLTT of a model is fully 
determined by its relative slope (hereafter, pulled speciation rate), 
denoted by A, = -M"dM/dt (in which M is the dLTT, Tis time before 
present (or age) and pis a label (for ‘pulled’)). It can be shown that 
A, = AP, in which P(r) is the probability that a lineage extant at age T 
survives until the present day and is included in the timetree. In the 
absence of extinction (4 = 0) and under complete species sampling 
(p =1),A, is identical to A; however, in the presence of extinction A, is 
pulled downwards relative to / at older ages, whereas under incom- 
plete sampling A, is pulled downwards relative to A near the present. 
Because the dLTT of a model is fully determined by A, and vice versa, 
two models are congruent if and only if they have the same A, at all 
ages. Ina similar way, it can be shown that two models are congruent 
if and only if they have the same product pA, (in which A, = A(O)) and 
the same ‘pulled diversification rate’’, which is another composite 
variable and is defined as 


1d 


p= A-Ht+ a (1) 


The r, is equal to the net diversification rate (r=A — 1) whenever A is 
constant in time (dA/dt=0), but differs from rwhenA varies with time. 

We arenowready to examine the breadth of congruent model sets. We 
begin with a model with speciation rate A > O, extinction rate > 0 and 
sampling fraction p € (0, 1]. If we denote 7, = pA,, then for any alterna- 
tive chosen extinction rate function y* > 0 and any alternative assumed 
sampling fraction p* € (0, 1], there exists a speciation rate function A*>0 
such that the alternative model (A*, u* and p*) is congruent to the original 
model (A, wand p). In other words, regardless of the chosen p* and p*, 
we can find a hypothetical A* that satisfies A* - y* + (1/A*)dA*/dt=r, and 
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Fig. 1| Illustration of congruent birth-death processes (simulations). 
Example of four hypothetical congruent—yet markedly different—birth-death 
models. All models exhibit the same dLTT, and would yield the same likelihood 
for any given extant timetree. a, dLTT and deterministic diversities (N) 
predicted by the models, plotted over age (time before present). 

b-d, Speciation rates (A) (b), extinction rates (1) (c) and net diversification 
rates (r=A-1) (d) of the models. Myr, million years. For additional examples, 
see Extended Data Fig. 4. 


p*A*(0) = n,. Indeed, to construct such a A* one merely needs to solve 
the following differential equation: 
da* 


Gn Ge tB) (2) 


with initial condition A*(0) = 7,/p* (solution in Supplementary Informa- 
tionsectionS.1.4). The above observation implies that—starting from 
almost any birth-death model—we can generate an infinite number of 
alternative congruent models simply by modifying the extinction rate 
(u) and/or the assumed sampling fraction (9). Alternatively, congruent 
models can be constructed by assuming various ratios of 1/A (Sup- 
plementary Information section S.1.5). This set of congruent models 
(hereafter, the congruence class) is thus infinitely large. The congru- 
ence class can have an arbitrary number of dimensions (depending on 
restrictions imposed a priori on A* and p*), as u* could depend onan 
arbitrarily high number of free parameters. 

As an illustration of these principles, the simulations in Fig. 1 show 
four markedly distinct and yet congruent models (pulled rates are 
shown in Extended Data Fig. 1). The first scenario exhibits a constant 
Aandatemporary spike in p (that is, amass extinction event), the sec- 
ond scenario exhibits a constant and atemporary drop inA around 
the same time, the third scenario exhibits a mass extinction event ata 
completely different time and a fluctuating A, and the fourth scenario 
exhibits an exponentially decaying and a fluctuating J. These congru- 
ent scenarios were obtained simply by assuming alternative extinction 
rates, anda myriad of other congruent scenarios exist. Figure 2 shows 
a model with exponentially varying speciation and extinction rates, 
A=ae*' and p=ye, witha, B, yand 6 fitted to atimetree of 79,874 extant 
seed plant species“ via maximum-likelihood methods. Simply by modi- 
fying the coefficient 6 and choosing A according to equation (2), one 
can obtain a similarly complex scenario with opposite trends over time 
(Fig. 2b). Similar examples can also be generated using more realistic 
speciation and extinction rates, three of which can be seen in Extended 
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Fig. 2 | IIlustration of congruent birth-death processes (real data). Birth— 
death model with exponentially varying A and y, fitted to an extant timetree of 
79,874 seed plant species" over the past 100 Myr, compared toa congruent 
model obtained by simply modifying the exponential coefficient of u.a, LTT of 
the tree, compared tothe dLTT predicted by the two models. b, Speciation 


Data Fig. 2 (based on data from ref. °), Extended Data Fig. 3 (based on 
data from ref. °) and Extended Data Fig. 4. 

Such ambiguities have previously been observed in special cases 
For example, a previous study" recognized that a variable A and con- 
stant yz can be exchanged for a constant A and a variable 1 to produce 
the same dLTT. Other work on constant-rate birth-death models has 
revealed that alternative combinations of time-independent A, and 
pcan yield the same likelihood for a tree!”!*. Our work not only uni- 
fies these previous findings (which are all special cases of our general 
theory), but in fact reveals that vast (infinite-dimensional) expanses 
of model space are fundamentally indistinguishable even if p is known 
or all extant species have been sampled. 


11,16 


Implications 


Toestimate A and y, previous phylogenetic studies have imposed largely 
arbitrary constraints. For example, many studies assume that A or p vary 
exponentially through time”. However, this functional form is rarely 
justified biologically, and alternative functional forms of comparable 
simplicity and shape can be envisioned. Normally one expects that, 
with sufficient data, fitting any of these forms will lead to qualitatively 
similar trends and shapes. This expectation simply does not hold here, 
because the best-fitting representative within a given model set will 
generally only be the one closest to the congruence class of the true 
process, rather than closest to the true process itself (Fig. 3). Conse- 
quently, fitting alternative functional forms can result in markedly 
different inferences with alternative trends in A and yu, even if each 
functional form used is in principle adequate for approximating the 
true historical A and py (examples are shown in Extended Data Fig. 5, 
Supplementary Information section S.10). This conclusion applies 
to almost any model set used in practice, including models in whichA 
and y change at discrete time points”°. Because any given true diver- 
sification history (even a relatively simple one) is unlikely to exactly 
match the particular functional form considered, fitting the latter 
may not even approximately yield the true diversification history. The 
existence of congruent scenarios can thus seriously alter macroevo- 
lutionary conclusions—for example, when assessing the influence of 
environmental factors on diversification dynamics (example shown 
in Supplementary Information section S.4, and further discussion 
in Supplementary Information section S.5). Our findings thus shed 
doubt over previous work on diversification dynamics that is based 
solely on extant timetrees, including some of the conclusions from work 
that we have coauthored®”. Previous studies have underestimated this 
issue because they typically consider only a limited set of candidate 
models at atime, both when analysing real datasets and when assessing 
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rates (A) and extinction rates (1) of the two models. In each model, pis almost 
identical toA. The two models cannot possibly be distinguished using extant 
timetrees alone. For additional examples, see Extended Data Fig. 2and 
Supplementary Information sections S.10 and S.11. 


parameter identifiability via simulations; as a result, previous studies 
have been (un)lucky enough to not compare two models in the same 
congruence class (see Supplementary Information sections S.3 and S.7 
for reasoning). We stress that common model selection methods that 
are based on parsimony or ‘Occam’s razor’ (suchas the Akaike informa- 
tion criterion”) generally cannot resolve these issues (Extended Data 
Fig. 6, details in Supplementary Information sections S.2 and S.10). 


Ways forward 


Our findings are analogous to classic results from coalescent theory in 
population genetics, in which many alternative models can give rise to the 
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Fig. 3 | Conceptual implications. Conceptual illustration of the limited 
identifiability of a diversification process, assumed to be adequately described 
by some unknown birth-death model (red circle; hereafter ‘true process’). The 
congruence class of the true process is shown as asub-space comprising a 
continuum of alternative models (pink area). In practice, maximum-likelihood 
model selection is performed among a parameterized low-dimensional set of 
allowed models, the precise nature of which can vary from case to case (for 
example, depending on assumed functional forms for A and y, or the number of 
allowed rate shifts)*°. The two continuous lines shown here represent two 
alternative cases of allowed model sets (for example, considered intwo 
alternative studies), from within each the model closest to the truth is (ideally) 
sought. However, in each case likelihood-based model selection will converge 
towards the allowed model closest to the congruence class (blue and green 
filled circles), which in general is not the allowed model that is actually closest 
to the true process (white circles). This identifiability issue persists even for 
infinitely large datasets. 


same drift process as the idealized Wright-Fisher model”. This realiza- 
tion was particularly important for the field: it focused the attention of 
researchers onthe dynamics of the effective population size, an identifiable 
parameter, rather than on actual (but non-identifiable) historical demogra- 
phy. Similarly, congruent birth-death models can be defined in terms of A, 
or—equivalently—in terms ofr, and pA,, all of which areidentifiable provided 
sufficient data’. Each congruence class contains exactly one model with 
y= Oand p=1, whichis also the model in which A=A,; hence, the pulled 
speciation rate can be interpreted as the speciation rate that generates 
the dLTT of the congruence class in the absence of extinctions and under 
complete species sampling. In other words,A, can beseenas the ‘effective’ 
speciation rate that fully explains the shape of the LTT of thetree. Similarly, 
each congruence class contains models with time-independent, and for 
these models r, =r; therefore, the pulled diversification rate can be inter- 
preted as the effective net diversification rate if A was time-independent. 

Fossil data could help to resolve the ambiguities highlighted here”, 
and biological knowledge could, in principle, also help to reduce ambi- 
guities. For example, if and are somehow known from other sources, 
the congruence class collapses to a unique diversification scenario 
(Extended Data Fig. 7). Nevertheless, for many taxa the fossil record 
remains scarce and ambiguous, and our general understanding of what 
constitutes a plausible diversification scenario is poorly developed. 
Rather than attempting to estimate A and y, one canestimateA,,r,and 
pA, (and, if p is known)—for example, by using likelihood methods” 
(Extended Data Fig. 8, Supplementary Information section S.9). Previ- 
ous work” has shown that r, can indeed yield insight into diversification 
dynamics and help to detect major transitions over time (Supplemen- 
tary Information section S.8), as changes in r, necessarily imply changes 
inAand/or yp. ThroughA,, it also becomes possible to simulate and ana- 
lyse diversification models with substantially simplified mathematical 
tools, as any model is congruent to a model with speciation rate A,, zero 
extinction and complete species sampling”. Reciprocally, many exist- 
ing estimation tools can be used to estimate A, and r, by constraining 
ito be zero or Ato be time-independent, respectively. Depending on 
the situation, other invariants of congruence classes may also have 
advantages, such as the ‘coalescent density’ introduced by ref. ”, which 
permits an elegant description of the distribution of branching ages 
(see Supplementary Information section S.8 for further details). 


Conclusions 


Without further information or biologically well-justified constraints, 
in general extant timetrees alone cannot be used to reliably infer 
speciation rates (except for the present day), extinction rates or net 
diversification rates. Consequently, correlations between, or rand 
fluctuating environmental factors (such as temperature) also cannot 
be reliably inferred, neither when A, yor rare first estimated and then 
related to the environmental factors nor if A, 4 and rare expressed as 
parameterized functions of the environmental factors and then fitted 
to the timetree (Supplementary Information section S.5), because dif- 
ferent parameterizations can lead to completely different inferences. 
Our findings could explain why diversification dynamics observed in 
the fossil record often contradict inferences based on phylogenet- 
ics’?>°° although other explanations have also been proposed”*”’. It 
is possible that similar major identifiability issues may also be hiding 
in other evolutionary reconstruction methods based on extant organ- 
isms alone, but this remains to be examined. Ona more positive note, 
we have resolved a long-standing debate and precisely clarified what 
informationcan be extracted from extant timetrees alone, formulated 
in terms of easily interpretable and identifiable variables. 
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Methods 


No statistical methods were used to predetermine sample size. Thor- 
ough mathematical derivations and computational details are provided 
in Supplementary Information sections S.1-S.11. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| Pulled speciation and diversification rates. a, b, Pulled speciation rate (a) and pulled diversification rate (b) of the four congruent models 
shownin Fig. 1. 
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Extended Data Fig. 2 | IIlustration of congruent birth-death processes 
(fossil data). a, Origination and extinction rates of marine invertebrate genera, 
estimated from fossil data. b, Congruent scenario to that ina, obtained by 
reversing the linear trend of y (thatis, fitting a linear curve to the original uz, and 
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then subtracting that curve twice) and adjusting A according to equation (2).c, 
Congruent scenario to that ina, assuming an extinction rate of zero. Further 
details are provided in Supplementary Information section S.10. 
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Extended Data Fig. 3 | Previous studies are likely to have over-interpreted 
phylogenetic data. Time-dependent birth-death model fitted toanearly 
complete extant timetree of the Cetacea, under the assumption of extinction 
rates of zero (u=0), compared toa congruent model in which the extinction 
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rate is close to the speciation rate (u=0.9A).a, LTT of the tree, compared tothe 
dLTT predicted by the two models. b, Speciation rates (A) and extinction rates 
(w) of the two models. c, Net diversification rates (r=A— 1) of the two models. 
Further details are provided in Supplementary Information sectionS.4. 
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Extended Data Fig. 4| Additional examples of congruent birth-death 
processes. a-c, Example of two congruent—yet markedly different—birth- 
death models. Both models exhibit atemporary spike in the extinction rate and 
atemporary spike in the speciation rate; however, the timings of these events 
differ substantially between the two models. Both models exhibit the same 
dLTT and the same pulled diversification rate (r,) and would yield identical 
likelihoods for any given extant timetree. a, Speciation rates (A and A*) and 
extinction rates (“and y*) of the two models, plotted over time. Continuous 


curves correspond to the first model, and dashed curves correspond tothe 
second model. b, Net diversification rates (rand r*) and pulled diversification 
rate (r,) of the two models. c, dLTT and deterministic total diversities (Vand N*) 
predicted by the two models. d-f, Another example of two congruent models. 
In the first model, the speciation and extinction rates both decrease 
exponentially over time, whereas in the second model the extinction rate 
increases exponentially over time and the speciation rate exhibits variable 
directions of change over time. In all models, the sampling fractionis p=0.5. 
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Extended Data Fig. 5 | Identifiability issues persist in large trees. a-—c, 
Diversification analysis of a timetree (about 114,000 tips) simulated froma 
birth—death process that exhibits a mass extinction event at around 5 Myr 
before present. a, LTT of the generated tree (long-dashed curve), dLTT of the 
true model that generated the tree (continuous curve) and dLTT of amaximum- 
likelihood fitted model (short-dashed curve) are shown. The fitted dLTT is 
practically identical to the true dLTT and thus is covered by the latter. b, True 
speciation and extinction rates (continuous curves), compared to fitted 
speciation and extinction rates (dashed curves). There is considerable 
disagreement between the fitted and true/ and y, despite the fact that the 
allowed model set could—in principle—approximate the true rates reasonably 
well.c, Pulled diversification rate (PDR) of the true model (continuous curve), 
compared tothe pulled diversification rate of the fitted model (dashed curve). 
d-f, Diversification analysis of atimetree (about 785,000 tips) simulated from 


a birth-death process that exhibits a rapid radiation event at around 5 Myr 
before present and amass extinction event at around 2 Myr before present. d-f 
are analogous to a-c. Thereis considerable disagreement between the fitted 
andtrueA andy, despite the fact that the allowed model set could—in 
principle—approximate the true rates reasonably well. Extended Data Figure 7 
provides the fitting results when pis fixed to its true value. g-i, Diversification 
analyses of an extant timetree of 79,874 seed plant species, performed either 
by fitting A andy ona grid of discrete time points or by fitting the parameters of 
generic polynomial or exponential functions forA andy. g, LTT of the tree, 
dLTT of the grid-fitted model and dLTT of the fitted parametric model. h, 
Speciation and extinction rates predicted by the grid-fitted model or the fitted 
parametric model. i, Pulled diversification rate predicted by the grid-fitted 
modeland the fitted parametric model. Further details are provided 

in Supplementary Information sections S.10 andS.11. 
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Extended Data Fig. 6 | Identifiability issues cannot be resolved with 

the Akaike information criterion. Maximum-likelihood birth-death models 
fitted toa tree comprising 1,000,000 tips, simulated on the basis of the 
origination and extinction rates of marine invertebrate genera estimated from 
fossil data. Top row, maximum-likelihood-fitted piecewise constant model 
(also knownas birth—-death-shift model), with grid size (n=11) chosen by 
minimizing the Akaike information criterion (AIC). Bottom row, maximum- 
likelihood-fitted piecewise linear model, with grid size (n=12) chosen by 
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minimizing the AIC. Left column, dLTTs of the fitted models compared tothe 
true dLTT andtheLTT of thetree. Right column, fitted speciation and 
extinction rates, compared to the true rates used to generate the tree. Inboth 
cases, the maximum-likelihood models poorly reflect the true rates despitea 
near-perfect match of the LTT, even when the complexity of the models was 
optimized on the basis of the AIC. For further details, see Supplementary 
Information sections S.2 and S.10. 
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Extended Data Fig. 7 | Estimating A when p and pare fixed or known. a-c, 
Example analysis of a simulated extant timetree (about 114,000 tips) that 
exhibits a mass extinction event at around 5 Myr before present. A birth-death 
model was fitted while fixing 1 and p to their true values; A was fitted at 

15 discrete time points. a, LTT of the generated tree (long-dashed curve), dLTT 
of the true model that generated the tree (continuous curve) and dLTT ofa 
maximum-likelihood fitted model (short-dashed curve). The fitted dLTT is 
practically identical to the true dLTT, and is thus covered by the latter. b, True 
speciation and extinction rates (continuous curves), along with the fitted 
speciation rate and fixed extinction rate (dashed curves). c, Pulled 
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diversification rate of the true model (r,, continuous curve), compared to the 
pulled diversification rate of the fitted model (dashed curve). d-f, Example 
analysis of asimulated extant timetree (about 785,000 tips) that exhibits a 
rapid radiation event at about 5 Myr before present and a mass extinction event 
at about 2 Myr before present. A birth-death model was fitted similarly to the 
example shown ina-c, and d-fare analogous to a-c. In both cases, rate 
estimation was restricted to ages at which the LTT included at least 

500 lineages. Further details are provided in Supplementary Information 
sectionS.10. 
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Extended Data Fig. 8 | Fitting congruence classes instead of models. 
Analysis of an extant timetree generated by a birth-death model that exhibits a 
temporary rapid radiation event about 5 Myr before present and amass 
extinction event about 2 Myr before present. A congruence class was fitted to 
the timetree either in terms of the pulled diversification rate (r,) and the 
product pA,, or in terms of the pulled speciation rate (PSR) (A,).a, LTT of the 


tree (long-dashed curve), together with the dLTT of the true model (continuous 


curve) and the dLTT of the fitted congruence classes (short-dashed curve); in 
bothcases, the fitted dLTT was almost identical to the true dLTT, andis thus 
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completely covered by the latter. b, Pulled diversification rate of the true 
model (continuous curve), compared to the fitted pulled diversification rate 
(short-dashed curve). c, Pulled speciation rate of the true model (continuous 
curve), compared to the fitted pulled speciation rate (short-dashed curve). The 
pulled diversification rate and pulled speciation rate were fitted viamaximum- 
likelihood methods, allowing the pulled diversification rate or pulled 
speciation rate to vary freely at 15 discrete equidistant time points. Further 
details are provided inin Supplementary Information sectionS.9. 
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Study description We use mathematical proofs, numerical simulations and analysis of previously published phylogenetic data to investigate the 
identifiability of past diversification dynamics from extant time-calibrated phylogenies. 


Research sample The following previously published datasets were used: 
Origination and extinction rates of marine invertebrate genera, estimated from the fossil record by (Alroy et al. 2008). 
Time-calibrated phylogeny of 79874 extant seed plant species, published by Smith and Brown (2018). 
Time-calibrated phylogeny of the Cetacea, published by Steeman et al. (2009). 


Sampling strategy Sample sizes (i.e., phylogeny sizes) were chosen as large as possible in our examples, to demonstrate that the identifiability issues 
discussed in the paper persist even for massive datasets. For simulated trees we used very large numbers of tips (larger than 
commonly seen in the literature), again to illustrate that our conclusions remain valid even for massive data sets. 

Data collection No new data were collected. 

Timing and spatial scale No new data were collected. 


Data exclusions No data was excluded from the analysis. 


Reproducibility Our full mathematical proofs and numerical procedures are described in detail in the manuscript and supplemental material. All new 
simulation code is published through the R package "castor" (version 1.5.5), which is freely available on CRAN. 
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Blinding Blinding was not relevant to our study, as no experiments were performed. 
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Pottery is one of the most commonly recovered artefacts from archaeological sites. 
Despite more than a century of relative dating based on typology and seriation’, 
accurate dating of pottery using the radiocarbon dating method has proven 
extremely challenging owing to the limited survival of organic temper and 
unreliability of visible residues” *. Here we report a method to directly date 
archaeological pottery based on accelerator mass spectrometry analysis of “Cin 
absorbed food residues using palmitic (C,,..) and stearic (C,¢.,) fatty acids purified by 
preparative gas chromatography° ®. We present accurate compound-specific 
radiocarbon determinations of lipids extracted from pottery vessels, which were 


rigorously evaluated by comparison with dendrochronological dates 


°10 and inclusion 


insite and regional chronologies that contained previously determined radiocarbon 
dates on other materials” ». Notably, the compound-specific dates from each of the 
Cye.0 and Cj¢.. fatty acids in pottery vessels provide an internal quality control of the 
results° and are entirely compatible with dates for other commonly dated materials. 
Accurate radiocarbon dating of pottery vessels can reveal: (1) the period of use of 
pottery; (2) the antiquity of organic residues, including when specific foodstuffs were 
exploited; (3) the chronology of sites in the absence of traditionally datable materials; 
and (4) direct verification of pottery typochronologies. Here we used the method to 
date the exploitation of dairy and carcass products in Neolithic vessels from Britain, 
Anatolia, central and western Europe, and Saharan Africa. 


Chronology lies at the heart of archaeology’’. Radiocarbon dating by 
accelerator mass spectrometry (AMS) is the most widely used method 
for providing calendrical chronologies for human activities over the 
past 50,000 years”, and is most commonly performed on samples 
of charred plant remains and bone”. Radiocarbon dates can be used 
alongside relative sequences, such as those derived from stratigra- 
phy or the typological analysis or seriation of artefact types, to build 
chronological models. Applying Bayes’ theorem enables radiocarbon 
dating to provide calendar age estimates with uncertainties as low as 
a few decades”. 

The invention of pottery in the late Pleistocene epoch was probably 
acritical driver for developments in food processing’®””. Pottery ves- 
sels can often be placed in robust relative chronological sequences 
using typology and seriation, although obtaining precise and accurate 


radiocarbon dates from pottery is challenging”?”. All sources of carbon 
associated with pottery vessels have been considered for dating” *, 
including organic temper, which occasionally survives firing, and surfi- 
cial food crusts, although these are rare and prone to contamination 
owing to their exposed nature”. By contrast, the lipidic components of 
food residues absorbed into—and protected by—the clay matrix during 
cooking occur very commonly’, often in high concentrations (milli- 
grams per gram of clay fabric). These offer an untapped resource for 
radiocarbon dating. The most common absorbed residues correspond 
to degraded animal fats characterized by their high abundances of C,,., 
and C,g,. fatty acids”*. The possibility of using preparative capillary gas 
chromatography (pcGC) to isolate chemically pure fatty acids from 
such residues for compound-specific radiocarbon analysis (CSRA) was 
recognized more than 20 years ago”?4, Although initial attempts 
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Fig. 1| Site location map, partial gas chromatograms and stable isotope 
determination of compound-specific radiocarbon-dated lipid residues 
preserved in Neolithic pottery vessels. a, Map of the location of the 
archaeological sites for which CSRA was used in this study. Scale bar, 1.000 km. 
CUI, Cuiry-lés-Chaudardes; ENS, Ensisheim; GEL, Geleen-Janskamperveld; 
KAR, Karwowo 1; KON, K6nigshoven 14; LDW, Ludwinowo 7; PPL, Principal 


to date pottery vessels were promising, the accuracy and precision 
demanded by archaeology could not be achieved owing to unidentified 
technical difficulties, leading to highly variable results”. 

We have brought together the latest technologies for radiocarbon 
measurements, including automated graphitization and MICADAS com- 
pact AMS, in conjunction with high-field 700-MHz NMR, to undertake 
systematic investigations of the pcGC protocol**. Rigorous assessment 
of contamination in compounds purified by pcGC was undertaken, 
leading to our invention of a solventless pcGC trap and implementa- 
tion of cleaning procedures to avoid between-run carryover*®. These 
advances reduce the exogenous contamination of fatty acids that has 
previously been associated with pcGC to below concentrations that 
would significantly affect measured radiocarbon ages. For archaeologi- 
cal animal fats, it has previously been demonstrated that two fatty acids 
isolated from the same matrix generate the same radiocarbon age (that 
is, statistically consistent at the 95% significance level), providing an 
internal quality control for archaeological dating®. In this study, we aim 
to extend this method to archaeological pot lipids. We selected pottery 
vessels that were richin animal fats from our database of lipid residues 
that we accumulated over the last three decades. Pottery vessels from 
chronologically well-characterized settings and different burial envi- 
ronments were analysed and the compatibility of pot lipid dates with 
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Place, London; ROS, Rosheim; SW, Sweet Track; TAK, Takarkori; TP, Catalhoéyiik 
East. b, Partial gas chromatograms of aselection of potsherds showing C,,.. and 
Cig.o fatty acid abundances. ¢, Scatter plots of ABC (= 8? Cig-9 — 8 °Cye.9) Values 
plotted against 6°C,,.. values (mean of 2 measurements) for all of the sherds 
dated (n=31), ranges on the left denote the mean +1s.d. of modern reference 
fats, as reportedinref.°. 


these existing chronologies was evaluated by statistical comparison 
of posterior density estimates for the key parameters and the use of 
indices of agreement with inclusion in these known frameworks (Fig. 1, 
Extended Data Table 1 and Supplementary Information 1). 

Weinitially focused on Neolithic Carinated Bowl pottery from the Sweet 
Track (Fig. 2a), an elevated wooden trackway discoveredina wetland area 
of the Somerset Levels””°”* in the United Kingdom (Supplementary Infor- 
mation 2). This site is critical because its construction has been precisely 
dated by dendrochronology to the winter-early spring of 3807-3806 BC 
and the trackway was used and maintained for approximately 10 years”’. 
Lipids from pots that were found alongside the trackway, and were prob- 
ably contemporaneous to its construction and use, have previously been 
dated, but the measured dates wereacentury later than the construction 
of the trackway”. Re-analysis of the two vessels (Fig. 2b) using our new 
approach produced uncalibrated radiocarbon ages of 5,110 + 25 years 
before present (BP; taken as AD 1950) (SW1) and 5,092 +26 BP (SW2), which 
are statistically indistinguishable (7 =9.0, 7(5%) =9.5, v=4) from the meas- 
urements of the tree rings included in the IntCal13 calibration curve for the 
relevant decade” (Fig. 2c). The calibrated dates of these ages are clearly 
compatible with the tree-ring dates for the construction of the trackway. 

Extending our approach to Anatolia, the Neolithic tell of Catal- 
hoéyiik East was a locus for the emergence and development of pottery 
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Fig. 2 | Sweet Track timbers, a pottery vessel and calibrated radiocarbon 
dates. a, Photograph of Sweet Track timbers. b, Photograph of a Carinated 
Bowl (SW2) that was recovered alongside the Sweet Track. Scale bar,5cm.c, 
Probability distributions of dates from pots deposited next to the Sweet Track 
(green) and from oak trees (black) included in IntCal13* that include the date of 
the Sweet Track construction in 3807-3806 Bc. Each distribution represents 
the relative probability that an event occurs at a particular time. For each of the 
dates, two distributions have been plotted: one in outline, which is the simple 
radiocarbon calibration, anda solid distribution, based on the model used. The 
square bracket down the left side along with the OxCal keywords define the 
overall model exactly (provided inSupplementary Information 2).A,A.ompand 
A, are the individual agreement indices, the combination agreement indices 
and the acceptable threshold to combine n radiocarbon dates, respectively. 
The photographs were provided by S.M. and are reproduced with permission 
fromthe Somerset Levels Project. 


production. A 21-m-deep stratigraphic sequence provides strong archae- 
ological prior information for a Bayesian chronological model that covers 
the upper parts of the mound (TP area)". The sequence of houses, mid- 
dens and burial structures has been combined with 50 radiocarbon dates, 
revealing a Neolithic sequence of occupation from the mid-sixty-fourth 
tothe mid-sixtieth centuries calibrated (cal.) Bc". Our compound-specific 
radiocarbon ages on adipose lipids” from four pottery vessels from four 
different contexts (TP.M17, 7,382 + 31 BP; TP.N10, 7,348 + 25 BP; TP.023, 
7,340 +27 BP; and TP.P13, 7,364 +25 BP) were incorporated into the Bayes- 
ian chronological model for this part of the site (Extended Data Figs. 1,2 
and Supplementary Information 3). The revised model for the Neolithic 
deposits inthe TP area shows posterior distributions for the key param- 
eters that are almost identical to those from the original model”. Their 
median values vary by an average of 4 years anda maximum of 10 years, 
confirming the compatibility of the radiocarbon ages determined using 
fatty acids with the site stratigraphy. On the basis of sensitivity analyses 
(Supplementary Information 3), this well-constrained model is at least 
as sensitive as measurements on paired materials to detect inaccura- 
cies. In this case, the CSRA dates not only provide direct dating for the 
importance of ruminant carcass products (Fig. 1c) to the inhabitants of 
Catalhéyiik at this time (derived from 6“C values of preserved fats), but 
also provide direct dating evidence for the climatic changes associated 
with the global event of 8.2thousand years ago (derived from compound- 
specific deuterium isotope analyses using the same fats)””. 
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The next analysis tests the accuracy of our dating approach using 
aclassic pottery seriation study related to Neolithic ceramics from 
Lower Alsace (France) that spans the second quarter of the fifth mil- 
lennium cal. Bc” (Supplementary Information 4). The regional corre- 
spondence analysis clearly separates the Hinkelstein, Grossgartach, 
Planig-Friedberg and Roéssen Middle Neolithic ceramic groups. We 
focused on vessels from three pits, all of which can be assigned to the 
Grossgartach phase (Fig. 3a, b). The sequence of ceramic phases was 
combined with the existing assemblage of 95 radiocarbon dates, which 
were largely measured on articulated bones, along with four CSRA dates 
on fatty acids (ROS-C-4596, 5,804 +25 BP; ROS-C-4600, 5,904 + 28 BP; 
ROS-C-4644, 5,931 + 26 BP; and ROS-C-4657, 5,912 + 28 BP) from the 
Grossgartach sherds in a model using Bayesian statistics. The phase 
boundaries in this revised model are very similar to those produced by 
the original analysis”, as median values differ by an average of 6 years 
and a maximum of 15 years (Fig. 3c). The sensitivity analyses (Supple- 
mentary Information 4) demonstrate that the model is particularly 
sensitive to small biases, and probably more sensitive than measure- 
ments on paired materials. The CSRA dates are clearly compatible with 
the attribution of these pottery vessels to the Grossgartach ceramic 
phase based on their decorative motifs, and with the other radiocarbon 
dates for this group. 

We then explored the introduction of a new food product—that is, 
milk—into Neolithic Europe by undertaking radiocarbon dating of 
animal fat residues, including dairy fats, that were recovered from early 
farming settlements with Linearbandkeramik (LBK) pottery (Fig. 1). 
These communities settled in central Europe from the early fifty-fourth 
century BC’. Animal fats in 12 potsherds from the earliest LBK contexts 
at 6 sites, in Poland, France, Germany and the Netherlands, produced 
radiocarbon dates that were modelled and shown to be compatible 
with the currency of LBK ceramics in northern and western Europe” 
(Extended Data Figs. 3, 4 and Supplementary Information 5). Sensitiv- 
ity analyses (Extended Data Fig. 4 and Supplementary Information 5) 
demonstrate that this model is more sensitive to older biases as we 
focused onearly settlements, illustrating the direct dating of anew food 
commodity. The radiocarbon dates on the earliest dairying residues 
suggest that the practice began in 5385-5225 cal. Bc (95% probability; 
start LBK lipid; Extended Data Fig. 3) and probably arrived with the 
earliest farmers in these areas. Thus, the linking of fatty acid struc- 
tures with compound-specific carbon isotope values and CSRA dates 
provides a powerful means of directly dating prehistoric foodways 
and their introduction. 

We next investigated pottery from the Sahara Desert to provide atest 
of the methodology for a region in which depositional conditions are 
very different from the temperate climes of northern Europe. The Takar- 
korirock shelter, located in the now hyper-arid area of the Acacus Moun- 
tains, southwest Libya, demonstrates evidence of animal exploitation 
based on rock art and archaeological finds“ (Extended Data Figs. 5, 6). 
Previous work revealed abundant adipose and dairy fat residues in frag- 
ments of the pottery vessels”. Stratigraphy and radiocarbon dating of 
arange of materials (bone collagen, charred plant remains, dung, skin 
and enamel bioapatite) placed deposits associated with Middle Pastoral 
pottery in the sixth-fifth millennia cal. Bc’**8”’. The fatty acids from 
5 potsherds, containing dairy fat (Extended Data Fig. 6b), produced 
uncalibrated radiocarbon ages of 5,993 + 28 BP (TAK443), 5,979 + 28 BP 
(TAK120), 5,493 +28 BP (TAK420), 5,348 + 24 BP (TAK21) and 5,085 + 24 BP 
(TAK1572). The CSRA dates were proven to be entirely compatible with 
the currency of Middle Pastoral Neolithic ceramics (Extended Data 
Fig. 6d and Supplementary Information 6), and the direct radiocarbon 
dating of dairy residues confirms that dairying in North Africa began as 
early as the end of the sixth millennium cal. Bc*”®>”’, Although the model 
sensitivity is weak based onthe small number of reference dates that it 
includes (Extended Data Fig. 7 and Supplementary Information 6), it 
demonstrates the possibility of dating potsherds from extremely arid 
burial conditions. In addition, direct dating of pottery lipids represents 
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Fig. 3 | Drawings, correspondence analysis and radiocarbon dates of 
Neolithic vessels from Alsace (France) modelled using Bayesian statistics. 
a, Drawings of decorated pottery vessels from the Grossgartach group from 
pits 122 (1, 2),50 (4,5) and 63 (3, 6, 7) from which the undecorated potsherds 
dated in the model were recovered. Scale bar, 5cm.b, Revised correspondence 


a major contribution to ascertain the correct cultural attribution of 
materials found in loose sediments (organic sands), which are typi- 
cal of desert environments and frequently found in highly disturbed 
sequences”, 

Finally, archaeological excavations of several pits by the Museum of 
London Archaeology in advance of building works at Principal Place, 
London (PPL11) revealed one of the largest assemblages of Neolithic 
pottery recovered so far from the City of London or its immediate envi- 
rons. Notably, the only finds other than pottery recovered from this 
deposit (lithics, bones and charred plant remains) were compromised 
by later disturbance and truncation. The assemblage comprised Neo- 
lithic plain and decorated bowls, consisting of thin-walled medium- 
sized open/neutral bowls, together with several smaller open bowls/ 
cups (J.C. etal., manuscriptin preparation). Similar material has been 
found elsewhere in the Thames Valley and beyond. Lipid-residue analy- 
ses revealed high concentrations of degraded animal fats in several 
sherds, which were shown by compound-specific 6“C values to derive 
from dairy and adipose fats (Fig. 1c). Radiocarbon measurements 
of fatty acids from four plain sherds yielded uncalibrated ages of 
4,911+ 27 Bp (PPLO12), 4,742 + 22 BP (PPLO15), 4,652 + 26 BP (PPLO20) 
and 4,733 + 22 BP (PPLO21). A statistical model confirms that the pottery 
dates fit well within the currency of Plain Bowls in southern Britain 
(Extended Data Fig. 8 and Supplementary Information 7). The sensitiv- 
ity analyses (Extended Data Fig. 9 and Supplementary Information 7) 
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analysis on the cultural assemblages (axis 1) and horizontal decorative motifs 
(axis 2), including features that contained the dated sherds from the Alsatian 
Neolithic groups. c, Revised statistical model of the Middle Neolithic with 
radiocarbon dates on pot lipids included in grey. The data are shown as 
described in Fig. 2c. Amodei is the model agreement index. 


are weaker in this case, but demonstrates the value of dating absorbed 
lipid residues in situations in which no other datable material exists. 
Our ability to undertake accurate radiocarbon dating of compound- 
specific fatty acids from pottery was invaluable in affording atemporal 
insight into some of the earliest traces of human activity in what is 
now the City of London. 

In summary, the radiocarbon determinations of lipid residues from 
Neolithic pottery vessels presented above, modelled against site chro- 
nologies, establish the CSRA of fatty acids as a robust tool for archaeologi- 
cal dating. Importantly, our method and findings bring pottery vessels 
within the range of other archaeological materials that are routinely 
dated by radiocarbon. The importance of this advance to the archaeo- 
logical community cannot be overstated. Pottery typology is the most 
widely used dating technique in the discipline, and thus the opportunity 
toanchor different kinds of pottery securely to the calendrical timescale 
willbe of utmost practical importance. Notably, pottery often survivesin 
circumstances in which other organic materials often do not and, there- 
fore, archaeological questions relating to chronology that are currently 
intractable will come within the scope of our technologies. 
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Methods 


Lipid extraction and isolation 

Potsherds were selected on the basis of the presence of terrestrial 
animal fats (dairy and ruminant carcass fats) in the lipid residue to 
avoid any possible reservoir effect caused by the processing of aquatic 
products in pots. A piece of 1-10 g of the potsherd was sampled, accord- 
ing to the lipid concentration. The sherds were extracted ina glass 
culture tube using H,SO,/MeOH (4% v/v, 3x 8 ml, 70 °C, 1h). The super- 
natants were centrifuged (2,500 rpm, 10 min) and combined into new 
culture tubes containing double-distilled water (5 ml). The lipids were 
extracted with n-hexane (4x 5 ml), transferred into 3.5-ml vials and 
blown to dryness at room temperature under a gentle nitrogen stream. 
Subsequently, around 180 pl of n-hexane was added to obtain a con- 
centration of fatty acid methyl esters (FAMEs) at 5 pg of C pl before 
transfer to an autosampler vial for isolation by pcGC. 

The pcGC consisted of a Hewlett Packard 5890 series II gas chro- 
matograph coupled to a Gerstel Preparative Fraction Collector bya 
heated transfer line. The pcGC was equipped witha column with a100% 
poly(dimethylsiloxane) stationary phase (Rxi-Ims, 30 m x 0.53 mm 
inner diameter, 1.5 pm film thickness, Restek). Helium was used as 
the carrier gas at a constant pressure of 10 psi. The GC temperature 
programme started with an isothermal hold at 50 °C for 2 min, the tem- 
perature was increased to 200 °C at 40 °C min”, to 270 °C at 10 °C min™ 
and finally increased to 300 °C at 20 °C min“ and held for 8.75 min. 
The Cy¢.o and C,s.9 FAMEs were injected (1 pl per run), separated and 
trapped 40 times per trapping sequence. Of the GC column effluent, 1% 
flows to the flame ionization detector, while the remaining 99% passes 
through a transfer line into the fraction collector, both of which were 
heated to 300 °C. Compounds were isolated based on their retention 
times®. The stationary phase degradation of the pcGC column and other 
sources of exogenous carbonaceous contamination were monitored 
ona Brucker Avance III HD 700 MHz NMR instrument following a previ- 
ously published procedure**. 


Radiocarbon determinations and statistical analysis 

The pcGC isolated compounds were transferred into Al capsules, after 
which they were combusted and graphitized in a Vario Microcube Ele- 
mental Analyser linked to an Automated Graphitisation System (AGE 3, 
lonPlus). All ofthe radiocarbon measurements were performed by the 
Bristol Accelerator Mass Spectrometer (BRAMS) facility at the Univer- 
sity of Bristol. Data reduction was performed using the software BATS” 
(v.4.07). Radiocarbon dates obtained for FAMEs were corrected for the 
presence of added methyl carbon using a mass balance approach**” 
and reported as the conventional radiocarbon ages™ (Supplementary 
Information 1). 

Two contemporaneous compounds (C,,., and Cy¢.. fatty acids) were 
dated and every pair of statistically indistinguishable measurements 
(at the 95% significance level)” was combined as a weighted average 
before Bayesian chronological modelling using OxCal v.4.2 and v.4.3°8 
and the currently internationally agreed radiocarbon calibration curve 
for the Northern Hemisphere, IntCal137°. The compatibility of the 
radiocarbon dates on absorbed fatty residues with existing sites and 
regional chronologies was assessed by including the lipid radiocar- 
bon dates into existing statistical frameworks ina position defined by 
archaeological information (for example, stratigraphy or seriation). 
Their compatibility with the existing chronologies were achieved by: (1) 
comparison of posterior density estimates for key modelled parameters 
with equivalent date estimates or known age by dendrochronology; 
(2) using the individual and model agreement indices'*” in models 
containing fatty acid dates; and (3) comparing posterior density esti- 
mates for key parameters from models that include the fatty acid dates 
to a model that does not include the fatty acid dates (Supplementary 


Information 1). The sensitivity of existing chronological models to 
the addition of the new radiocarbon measurements was evaluated as 
above, after deliberately biasing the radiocarbon dates on pottery ves- 
sels to varying degrees while assessing the effect on posterior density 
estimates for the key parameters and indices of agreements (Supple- 
mentary Information 1-7). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Data Figs. 1-9, Extended Data Table 1 and Supplementary Information. 
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Extended Data Fig. 1| Schematic showing the stratigraphic information of the Neolithic occupation of the TP area at Catalhéyiik (Turkey). This information 
was included in the chronological model defined in Extended Data Fig. 2. Contexts containing potsherds dated in this study are highlighted in green. 
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OxCal v4.2.4 Bronk Ramsey (2013); r:1 IntCal13 atmospheric curve (Reimer et a/, 2013) 
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Extended Data Fig. 5| The Tadrart Acacus Mountains in southwest Libya. a, b, The Wadi Takarkori area (dashed rectangle). c, Schematic plan of the excavated 
areas. Allsampled sherds come from the main sector. 


d OxCal v4.2.4 Bronk Ramsey (2013); r:1 IntCal13 atmospheric curve (Reimer et a/_ 2013) 
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Extended Data Fig. 6| Site stratigraphy, photographs of potsherds and 
radiocarbon dates of Middle Pastoral pottery vessels from Takarkori 
(Libya) modelled using Bayesian statistics. a, Stratigraphic context of 
sampled potsherds from Takarkori east-west profile of the southern wall of the 
Takarkori north-south (Extended Data Fig. 5). (a) aeolian sand; (b) sand richin 


organic matter; (c) lenses of undecomposed plant remains; (d) ash; (e) charcoal; 


(f) slurry deposit; (g) eroded sand from the wall; (h) bedrock. b, Photographs of 
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the five potsherds analysed showing typical Middle Pastoral decorative 
patterns.c, Example of temporally and spatially wide deposit of organic sands 
(detail of layer 25, Takarkori main sector). d, Statistical model of the Middle 
Pastoral period showing the comparison of pot lipid dates (in green) with 
previous radiocarbon measurements. Data are shownas described for 
Extended Data Fig. 2. 
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OxCal v4.3.2 Bronk Ramsey (2017); r:5 IntCal13 atmospheric curve (Reimer et al 2013) 
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Extended Data Fig. 7 | Sensitivity analyses of radiocarbon dates on vessels the Extended Data Fig. 6d and models including fatty acid dates that were 
from Takarkorirock shelter, Libya. Probability distributions for the deliberately biased by 1o, 20, 40, 80, 200and 400. Data are shownas described 
beginning and end Middle Pastoral period ceramics from Takarkorirock for Extended Data Fig. 4. 

shelter, Libya (no pot lipid dates) compared with those of the model shown in 


OxCal v4.2.4 Bronk Ramsey (2013); r:1_IntCal13 atmospheric curve (Reimer ef al_ 2013 
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Extended Data Fig. 8 | Probability distributions of dates associated with the use of early Neolithic Plain Bowl pottery in southern Britain. Prior 
distributions have been taken from the models described in the text and in the Supplementary Information. Data are shownas described for Extended Data Fig. 2. 
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OxCal v4.3.2 Bronk Ramsey (2017); r:1 IntCal13 atmospheric curve (Reimer et al 2013) 
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Extended Data Fig. 9 | Sensitivity analyses of radiocarbon dates on vessels the model shown in Extended Data Fig. 8 and models including fatty acid dates 


from Principal Place, London. Probability distributions ofthestartandendof that were deliberately biased by 20, 40, 80 and 160. Data are shownas described 
early Neolithic Plain Bowl pottery in southern Britain compared with those of for Extended Data Fig. 4. 


Extended Data Table 1| Summary of radiocarbon dates of lipids preserved in pottery vessels 


Site Location Potsherd # Description ce Laboratory# Cis: age Cis. age Combined _ Reference 
(ug/e) age 
Sweet Track Somerset levels, SW1 Carinated Bowl, refitted sherd 13,806 BRAMS-1520 5,105+33 5,114+432 5,110+25 9,10,25 
England sw2 Carinated Bowl, refitted sherd 4,900 BRAMS-1521 _5,089+38 5,094+32 5,092+26 
Catalhéyiik East = Konya, Turkey TP.M17 Holemouth jar, single sherd 393 BRAMS-1654 7,338442 7,416+39 7,382431 11,27 
‘TP Area’ TP.N10 Holemouth jar, refitted sherd 575 BRAMS-1699 =7,318429 7,378+30 7,348425 
TP.O23 Holemouth jar, refitted sherd 1,390 BRAMS-1546 7,290 + 36 7,375 + 32 7,340 + 27 
TP.P13 Holemouth jar, single sherd 362 _BRAMS-1703__7,328+36 7,394+29 _7,364425 
Rosheim Lower Alsace, ROS-C-4596 Coarse Kumpf, single sherd 973 BRAMS-1526 5,810 + 30 5,798 + 30 5,804 + 25 12 
‘Sandgrube’ France ROS-C-4600 Coarse Kumpf, refitted sherd 4,163 BRAMS-1527 5,897 + 36 5,909 + 35 5,904 + 28 
ROS-C-4644 Fine Kumpf, single sherd 6,064 BRAMS-1525 5,937 + 33 5,926 + 30 5,931 +4 26 
ROS-C-4657 Coarse Kumpf, single sherd 1,914 _BRAMS-1524 _5,885+37 _5,934+34 _5,912+28 
Ensisheim Upper Alsace, ENS-C-5913 Coarse Kumpf, single sherd 1,177. BRAMS-1915 6,345 + 31 6,303 + 31 6,324 + 26 12,13 
‘Ratfeld’ France ENS-C-5915 Coarse Kumpf, single sherd 771 BRAMS-1916 6383432 6314433 6,348+426 
ENS-C-5934 Coarse Kumpf, single sherd 1,645 BRAMS-1958 6,282+30 6,258430 6,270+25 
ENS-C-5940 Coarse Kumpf, single sherd 2,082 | BRAMS-2031 6,162 + 33 6,239 + 30 6,206 + 26 
Cuiry-lés- Aisne, CUI-C-5708 Coarse Kumpf, single sherd 881 BRAMS-1917 6,252+34 6,218436 6,236+27 12,13 
Chaudardes France CUI-C-5801 Coarse Kumpf, single sherd 9,886 BRAMS-2021 6,138 + 30 6,134 + 30 6,136 + 25 
Kénigshoven 14 Rhineland, KON-C-5594 Coarse Kumpf, single sherd 531 BRAMS-2029 6,253 + 29 6,298 + 29 6,276 + 24 12,13 
Germany KON-C-5598 Coarse Kumpf, single sherd 1,023 BRAMS-2026 6,106 + 34 6,139 + 34 6,123 + 27 
Geleen- Graetheide, GEL-C-3298 Coarse Kumpf, single sherd 577 BRAMS-2032 6,188+31 6,253+29 6,224425 12,13 
Janskamperveld — The Netherlands 
Karwowo 1 Pomerania, KAR-C-3636 Coarse Kumpf, single sherd 3,316 BRAMS-2028 6,176 + 30 6,230 + 30 6,204 + 25 12,13 
Poland KAR-C-3677 Coarse Kumpf, single sherd 1,900 BRAMS-2025 6,255 + 30 6,214 + 32 6,236 + 26 
Ludwinowo 7 Kuyavia, Poland LDW-C-2267 Coarse Kumpf, single sherd 323 BRAMS-2024 6,173+36  6,179+30 6,177+26 12, 13 
Takarkori Acacus TAK 21 Decorated, single sherd 9,503 BRAMS-1522 5,362 + 33 5,331 + 32 5,348 + 24 14, 28, 29 
Rockshelter mountains, TAK1572 Decorated, single sherd 3,558 BRAMS-1523 5,099+38  5,071+32 5,085 + 24 
Libya TAK 120 Decorated, single sherd 5,593 BRAMS-2608 6,008 + 35 5,949 +35 5,979 + 28 
TAK 420 Decorated, single sherd 1,119 BRAMS-2609 5,487+34 5,4984+35  5,493+28 
TAK 443 Decorated, single sherd 17,217 __BRAMS-2610 _6,021+35 _5,962+35 _5,992+ 28 
Principal Place London, England PPLO12 (<1814>) Plain Bowl, single sherd 713 BRAMS-2618 4,894+34 4928433 4,911+27 15 
PPLO15 (<1845>) Plain Bowl, refitted sherd 1,999 BRAMS-2479 4,708+33  4,771430 4,742+22 
PPLO20 (<1850>) Plain Bowl, refitted sherd 3,660 BRAMS-2483 4,628+40  4,670+34  4,652+426 
PPLO21 (<1819>) Plain cup, single sherd 2,985 BRAMS-2485 _ 4,732 +32 —_4,734+30 4,733 + 22 


Vessel descriptions, lipid concentrations and conventional radiocarbon ages (as defined previously® and calculated according to previously published methods”) of Ci¢,9 and Cig. fatty acids 


(which passed the internal quality control) extracted from pottery vessels. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size The vessels dated were selected based on their lipid concentration which determined the number of samples radiocarbon dated in this paper 


Data exclusions | Radiocarbon dates that did not pass the internal criterion (chi-square test on C16 and C18 fatty acids radiocarbon ages) explained in the SI 
document were excluded from chronological modelingr 


Replication o replication of radiocarbon dates was undertaken due to the destructive nature of the radiocarbon dating technique, and limited sample 
availability. Tests of repeatability have been presented in a previously published paper Casanova et al. (2018) cited in this paper. 


Randomization Randomization was not relevant to the study. Our aim was to check the accuracy of radiocarbon dates determined for pottery vessels from 
secure and well defined archaeological contexts. 
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The taste of sugar is one of the most basic sensory percepts for humans and other 
animals. Animals can develop a strong preference for sugar even if they lack sweet taste 
receptors, indicating a mechanism independent of taste’ >. Here we examined the 


neural basis for sugar preference and demonstrate that a population of neurons inthe 
vagal ganglia and brainstem are activated via the gut-brain axis to create preference for 
sugar. These neurons are stimulated in response to sugar but not artificial sweeteners, 
and are activated by direct delivery of sugar to the gut. Using functional imaging we 
monitored activity of the gut-brain axis, and identified the vagal neurons activated by 
intestinal delivery of glucose. Next, we engineered mice in which synaptic activity in 
this gut-to-brain circuit was genetically silenced, and prevented the development of 
behavioural preference for sugar. Moreover, we show that co-opting this circuit by 
chemogenetic activation can create preferences to otherwise less-preferred stimuli. 
Together, these findings reveal a gut-to-brain post-ingestive sugar-sensing pathway 
critical for the development of sugar preference. In addition, they explain the neural 
basis for differences in the behavioural effects of sweeteners versus sugar, and uncover 
an essential circuit underlying the highly appetitive effects of sugar. 


Sugar is a fundamental source of energy for all animals, and corre- 
spondingly, most species have evolved dedicated brain circuits to seek, 
recognize and motivate its consumption‘. In humans, the recruit- 
ment of these circuits for reward and pleasure—rather than nutritional 
needs—is thought to be animportant contributor to the overconsump- 
tion of sugar and the concomitant increase in obesity rates. Inthe 1800s 
the average American consumed less than 4.5 kg of sugar per year’; 
today, following the broad availability of refined sugar in consumer 
products, the average consumption is more than 45 kg per year®. 

Sweet compounds are detected by specific taste receptor cells 
on the tongue and palate epithelium’”*. Activation of sweet taste 
receptor cells sends hardwired signals to the brain to elicit recogni- 
tion of sweet-tasting compounds”””. We and others have studied the 
circuits linking activation of sweet taste receptors on the tongue to 
sweet-evoked attraction®”™”. Surprisingly, even in the absence of a 
functional sweet-taste pathway, animals can still acquire a preference 
for sugar!””. Furthermore, although artificial sweeteners activate the 
same sweet taste receptor as sugars, and they may do so with vastly 
higher affinities’, they fail to substitute for sugar in generating a behav- 
ioural preference”. 

Together, these results suggested the existence of a sugar-specific, 
rather than sweet-taste-specific pathway, that operates independently 
of the sense of taste to create preference for sugar and motivate con- 
sumption?™. Here, we dissect the neural basis for sugar preference. 


Sweet versus sugar preference 


When non-thirsty, wild-type mice are given a choice between water 
and sugar they drink almost exclusively from the sugar solution’. If, 
however, they are allowed to choose between an artificial sweetener 
(for example, acesulfame K (AceK)) and sugar, using concentrations 
at which both are equally attractive, naive mice initially drink from 
both bottles at a similar rate (Fig. 1a). However, within 24 h of expo- 
sure to both choices, their preference is markedly altered, such that 
by 48 h, they drink almost exclusively from the bottle containing sugar 
(Fig. 1a, b, compare 15 h with 48 h). This behavioural switch also hap- 
pens in knockout (KO) mice lacking sweet taste (7rpm5“ (hereafter 
TRPMS KO)*"" or TasiIr2‘ Tasir3 (hereafter TIR2/3 KO)’; Fig. Ic). 
Similar observations have been made in several studies, primarily using 
flavour-conditioning assays’”. Thus, although taste-knockout mice 
cannot taste sugar or sweetener, they learn to recognize and choose 
the sugar, most probably as a result of strong positive post-ingestive 
effects”. 

Notably, the preference for sugar does not rely on its caloric 
content’®. For example, if sugar is substituted for the non-metabolizable 
glucose analogue (methyl-a-D-glucopyranoside (MDG))” mice still 
develop a strong preference for MDG, just as they do for glucose (Fig. 1b; 
Extended Data Fig. 1). Thus, the signalling system recognizes the sugar 
molecule itself rather than its caloric content or metabolic products. 


"Zuckerman Mind Brain Behavior Institute, Howard Hughes Medical Institute and Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA. 7Department 
of Biological Sciences, Columbia University, New York, NY, USA. “Department of Neuroscience, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA. ‘These 


authors contributed equally: Hwei-Ee Tan, Alexander C. Sisti. “e-mail: cz2195@columbia.edu 
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Fig. 1| Sugar activates the gut-brain axis. a, Mice were allowed to choose 
between 600 mM glucose and 30 mM AceK (sweetener). Preference was tracked 
by electronic lick counters in each port. Bars at the top show lick rasters for 
glucose (red) versus AceK (blue) from the first and last 2,000 licks of the 
behavioural test. Note that by 48 h the animals drink almost exclusively from the 
sugar bottle. b, Preference plots for sugar versus AceK (n=9 mice, two-tailed 
paired t-test, P=2.39 x 10°) and MDG versus AceK (n=5 mice, two-tailed paired 
t-test, P=0.0024; Extended Data Fig. 1). Note that mice may begin the 
behavioural preference test exhibiting no preference for sugar (preference 

index ~ 0.5), some preference for sugar (preference index > 0.5) or with an initial 
preference for the sweetener (preference index < 0.5). However, in all cases they 
switched (or substantially increased) their preference towards sugar. c, Mice 
lacking the sweet taste receptor (T1R2/3 KO)’ (n=5 mice, two-tailed paired t-test, 
P=0.0038), and mice lacking TRPMS5 (TRPMS5 KO)’ (n=7 mice, two-tailed paired 
t-test, P=0.0001) switched their preference to sugar even though they cannot 
taste it. d, Schematic of sugar stimulation of Fos induction. Strong Fos labelling is 
observed in neurons of the cNST (highlighted yellow). Scale bars, 100 pm. Similar 
results were obtained in multiple mice in each experiment (Extended Data Fig. 2). 
e, Quantification of Fos-positive neurons. The equivalent area of the cNST 
(bregma -— 7.5 mm) was processed and counted for the different stimuli. The 
signal present in water alone was subtracted before plotting; ANOVA with Tukey's 
honestly significant difference (HSD) post hoc test against AceK (n= 6 mice): 
P=4.68 x10™ (glucose, n=8 mice), P=0.001 (MDG, n=5 mice). Values are 

mean +s.e.m. 


Brain neurons activated by sugar 


For an animal to develop a preference for sugar over sweetener, it must 
recognize and distinguish between two innately attractive stimuli. 
We reasoned that if we could identify a population of neurons that 
respond selectively to the consumption of sugar, it may provide an 
entry to reveal the neural control of sugar preference and the basis of 
sugar craving. 

We exposed separate cohorts of mice to sugar, sweetener or water, 
and examined their brains for induction of Fos as a proxy for neural 
activity”? (see Methods). Our results showed prominent bilateral 
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labelling in the caudal nucleus of the solitary tract (cNST; Fig. 1d), an 
area known to function as a nexus of interoceptive signals conveying 
information from the body to the brain”. By contrast, the cNST was 
not substantially labelled in response to sweetener or water controls 
(Fig. 1d, Extended Data Fig. 2a). Furthermore, if these cNST neurons are 
involved in sugar-preference behaviour, they must also be activated by 
MDG (Fig. 1d, e), and their activation by sugar should be independent 
of the taste system (Extended Data Fig. 3a). 

How do sugar signals reach the cNST? The finding that prefer- 
ence for sugar does not require the taste system strongly suggested 
post-ingestive recognition. Therefore, we tested whether intragas- 
tric application of sugar was sufficient to activate the cNST. As pre- 
dicted, direct gut infusion of sugar (but not sweetener) is sufficient to 
activate the cNST as robustly as oral ingestion (Extended Data Fig. 3b). 
These results also substantiate previous behavioural studies showing 
that intragastric infusion of glucose is sufficient to condition flavour 
preference”””?. 


The gut-brain axis 


Anumber of recent studies have implicated the gut-brain axis as a key 
mechanism for transmitting information from the gut to the brain via 
the vagus nerve***°, The gut-brain axis is emerging as a fundamental 
conduit for the transfer of neural signals informing the brain of the 
metabolic and physiologic state of the body. Ifinformation about sugar 
detection is being transferred from the gut to the cNST via the gut-brain 
axis, then it should be possible to directly monitor the activity of this 
circuit by using real-time recordings of cNST activation in response 
to synchronized gut stimulation with sugar. Furthermore, this activ- 
ity should be abolished following transection of the vagal nerve, and 
notably, silencing vagal sensory neurons should prevent the creation 
of sugar preference. 

We used fibre photometry” to record sugar-evoked responses in the 
cNST of mice expressing the genetically encoded calcium indicator 
GCaMPé6s in excitatory neurons (Vglut2-cre;Ai96; Vglut2is also known 
as Slc17a6). To deliver stimuli to the gut, we placed a catheter directly 
into the duodenal bulb and created an exit port by transecting the 
intestine about 12 cm distally (Fig. 2a, see Methods). As predicted, 
our results (Fig. 2b—d) showed robust responses to glucose and MDG. 
Most notably, all activity was abolished after bilateral transection of 
the vagal nerves (Fig. 2b-e). 

Next, we examined whether cNST neurons activated in response to 
sugar ingestion indeed receive direct input from vagal ganglion neurons 
(that is, from the nodose ganglia; see Extended Data Fig. 4a, b). Totest 
this, we used the targeted recombination in active populations (TRAP) 
system?>” to target Cre recombinase to sugar-activated cNST neurons, 
and used a Cre-dependent monosynaptic retrograde viral reporter to 
identify their synaptically connected input neurons?°”), 

We infected the cNST with adeno-associated virus (AAV) carrying a 
Cre-dependent glycoprotein coat and a surface receptor for a trans- 
synaptic reporter virus*°*!, and TRAPed sugar-activated neurons 
(Fig. 3a; see Extended Data Fig. 4c, d for selectivity of TRAPing). Next, 
we infected the TRAPed neurons with the retrograde rabies reporter 
(RABV-dsRed), and investigated whether sugar-activated cNST neurons 
receive input from vagal ganglion neurons. As controls, we carried out 
similar experiments but used water or sweetener as TRAPing stimuli. 
Our results (Fig. 3b, c) revealed large numbers of nodose neurons 
labelled by the transsynaptic tracing strategy, demonstrating that 
the sugar-activated cNST neurons receive direct monosynaptic input 
from the vagal ganglion. By contrast, when we used AceK or water for 
TRAPing, only asmall number of vagal neurons were labelled; we believe 
these represent activation to licking/drinking or ingestion (Fig. 3b, c). 

Finally, we carried out a genetic vagotomy by globally silencing 
nodose sensory neurons (see Methods), which—as predicted—pre- 
vented the development of sugar preference (Extended Data Fig. 3c, d). 
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Fig. 2 | Silencing the sugar-activated circuit abolishes sugar preference. a, Fibre 
photometry monitoring glucose-evoked responses of cNST neurons. The 
excitatory neurons in the cNST were targeted with GCaMP6s using VG/ut2-cre 
mice**. b-d, Neural responses following intestinal delivery of glucose, AceK or 
MDG. Note strong responses to sugar (b) and MDG (d). The light traces denote 
normalized two-trial averages from individual animals and the dark trace is the 
average of all trials. Black bars below traces indicate the time and duration of 
stimuli. The average responses after bilateral vagotomy are shownin red 

(see Methods). Stimuli: 500 mM glucose, 30 mM AceK or 500 mM MDG; n=4 mice. 
NR, normalized response. e, Quantification of neural responses before and after 
vagotomy. Two-tailed paired t-test, P=3 x 10° (glucose), P=5 x 10° (MDG), n=4 
mice. Dataare mean +s.e.m. f, Schematic of silencing strategy. TRAP2 mice” were 
stimulated with 600 mM glucose to induce expression of Cre recombinase in the 
CNST. AAV-DIO-TetTox” was then targeted bilaterally to the cNST for silencing. g, 
Silencing the sugar-preference neurons in the cNST does not impair the innate 
attraction to sugar or sweeteners. The graph shows preference for 600 mM glucose 
versus water, and preference for 30 mM AceK versus water. n= 6 mice. Data are 
mean +s.e.m.h, Sugar-preference graphs (48-h tests) for wild-type mice, 
demonstrating the robust development of preference for sugar versus sweetener 
(see also Fig. 1). By contrast, silencing sugar-activated neurons inthe cNST 
abolishes the development of sugar preference. n= 6 mice; two-sided Mann- 
Whitney U-test, P=4 x 10 *; TetTox-silenced animals consumed as much of the AceK 
sweetener as they did sugar (see also Extended Data Fig. 5). Dataare mean+s.e.m. 


Neurons in the cNST mediate sugar preference 


Ifthe gut-to-brain sugar-activated cNST neurons are essential for creat- 
ing preference for sugar, then blocking their function should prevent 
the formation of sugar preference. Therefore, we engineered mice 
in which synaptic transmission in the sugar-preference neurons was 
genetically silenced by targeted expression of tetanus toxin light chain 
(TetTox)*. Our strategy relied onthe TRAP system”®”’ to express induc- 
ible Cre recombinase in sugar-activated cNST neurons, and bilaterally 
injecting the cNST with an AAV carrying the Cre-dependent TetTox 
construct (Fig. 2f, see Methods). 

First, we needed to ensure that silencing this circuit did not affect the 
innate ability of the animals to be attracted to sweet taste, including sugar 
and sweeteners. Indeed, when the TetTox-targeted mice were tested to 
choose between sweet or water, they selected the sweet-tasting solutions 
(either AceK or glucose; Fig. 2g). However, silencing the sugar-activated 
cNST neurons abolished their capacity to develop a preference for sugar 
over artificial sweetener, even after prolonged testing sessions (Fig. 2h, 
Extended Data Fig. 5). These results illustrate the essential role of this 
circuit in driving the behavioural preference for sugar. 


Vagal neurons sensing gut sugar 


As information about sugar detection is being transferred via the 
gut-vagal-brain axis, we set out to directly monitor the activity of 
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Fig. 3| Vagal ganglion neurons transmit sugar signals to the brain. a, Strategy 
for targeting a red fluorescently labelled retrograde transsynaptic rabies 
reporter (RABV-dsRed)**” to the cNST. Sugar-TRAP neurons in the cNST 
(designated as ‘starter cells’)>°*! were infected with AAVs encoding proteins 
required for infection with RABV-dsRed, resulting in labelling of the 
monosynaptic inputs of the sugar-activated cNST neurons by the retrogradely 
transsynaptic transfer of the RABV-dsRed virus. b, Quantification of 
retrogradely labelled RABV-dsRed neurons in the nodose ganglion. Sugar versus 
AceK TRAP labelling (n =3 mice). ANOVA, Tukey’s HSD post hoc test, P= 0.0449. 
We also performed control TRAP labelling with water (n =2 animals). Sugar 
versus water TRAP: ANOVA, Tukey’s HSD post hoc test, P= 0.0407; AceK versus 
water TRAP: P= 0.9, Data are mean +s.e.m. c, Sugar-TRAP CNST neurons (starter, 
green) receive monosynaptic input from vagal neurons (RABV, red). Note the 
absence of starter cells in the nodose, confirming that the RABV (red) cells 
represent retrogradely labelled neurons”. Scale bars, 100 um. 


this circuit by imaging vagal-neuron responses to gut stimulation 
with sugar. 

We implemented a vagal ganglion functional imaging platform 
(Fig. 4a) by targeting the genetically encoded calcium indicator 
GCaMP*® to vagal sensory neurons* (Vglut2-cre;Ai96). To visualize 
the neurons and measure calcium dynamics, we exposed a1-cm ventral 
window into the ganglion and used a one-photon microscope equipped 
with an electron-multiplying CCD camera for imaging®. For most imag- 
ing sessions, the intestinal segment was exposed to a pre-stimulus 
application of PBS, then a 10-s (33 pl) or 60-s (200 pl) test stimulus, 
and a3-min post-stimulus wash. Neuronal signals were analysed for 
statistically significant increases in intracellular calcium over baseline 
(see Methods), and aneuron was classified as a responder if it exhibited 
responses in more than 60% of the trials*®. 
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Fig. 4| Imaging the gut-brain axis. a, We imaged calcium responses in vagal 
sensory neurons expressing the fluorescent calcium indicator GCaMPé6s while 
stimulating the intestines. b, Heat maps depicting z-score-normalized 
fluorescence traces*” from vagal neurons identified as glucose responders. 
Each rowrepresents the average activity of a single cell to three trials. Stimulus 
windowis shown by dashed white lines. Left, responses of n=206 vagal 
neurons toa 60-s intestinal infusion of 500 mM glucose; note lack of responses 
to30 mM AceK. Right, heat maps depicting n= 133 vagal neurons that 
responded to 60-s infusion of 500 mM glucose, and tested for their responses 
to500 mM MDG. Heat maps were normalized across stimuli; responses to 
glucose and MDG were similar (two-tailed paired t-test, P=0.06).c, Sample 
traces of vagal-neuron responses to intestinal stimulation with 60-s pulses of 
30 mM AceK and 500 mM glucose from 3 mice (top), or to10-s pulses of 500 mM 
glucose and 500 mM MDG (middle and bottom). Note the reliability and rapid 
onset of responses to the 10-s stimulus (Extended Data Fig. 6c). When usinga 
10-s stimulus, to minimize potential osmolarity responses (Extended Data 


First, we examined how vagal neurons respond to intestinal delivery 
of glucose versus sweetener. Delivering glucose into the intestines 
elicited significant calcium responses in subsets of ganglion neurons 
(Fig. 4b); we analysed the responses from the vagal ganglia of 8 different 
mice to a60-s stimulus of glucose or AceK, and identified around 200 
neurons that displayed statistically significant responses to glucose, 
but less than 1% of these neurons displayed stimulus-dependent activity 
to AceK (Fig. 4b). As expected, intestinal delivery of MDG also activated 
the majority of vagal neurons that responded to glucose (Fig. 4b, c, 
Extended Data Fig. 6a). 

Next, we assessed the reliability and temporal causality of the 
vagal responses by reducing the stimulus window from 60 sto10s. 
Our results showed that vagal responses to intestinal glucose are 
robust and reliable” (Fig. 4c, Extended Data Fig. 6b, c). Overall, we 
examined 51 ganglia and 4-5% of GCaMP-expressing neurons (205 out 
of 4,803 neurons) responded to the 10-s glucose stimulus (Extended 
Data Fig. 6d). 

As neurons in the nodose ganglion innervate the gut” (that is, the 
source of the gut-brain signal), the cell bodies of the sugar-sensing 
neurons in the nodose ganglion should be retrogradely labelled by 
applying a tracer from their afferents in the gut”. Thus, we injected 
fluorescently conjugated cholera toxin subunit B (CTB)’’ into the 
duodenum of GCaMP-expressing mice (Extended Data Fig. 7a), and 
examined the labelled duodenal innervating neurons for responses 
to intestinal delivery of sugar. Indeed, around 20% of the duodenum 
back-filled vagal sensory neurons robustly responded to glucose 
(Extended Data Fig. 7b, c). 

We note that a previous study reported the characterization of 
candidate nutrient-sensing neurons in the nodose ganglia. These 
neurons responded indiscriminately to high concentrations of several 
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Fig. 8), approximately 5% of imaged neurons showstatistically significant 
responses to glucose (Extended Data Fig. 6d). We compared imaging sessions 
with both the right and left ganglia” and did not observe any meaningful 
difference in the proportion of glucose-responding neurons (Extended Data 
Fig. 6e).d, Vagal-neuron responses to 3-OMG (top) and galactose (bottom), 
n=3 independent experiments each. These agonists activate vagal neuronsina 
similar manner to glucose (Extended Data Figs. 2b, 10a, b).e, Heat maps of 46 
glucose-responding neurons to 500 mM fructose and 500 mM mannose (n=5 
ganglia). The monosaccharides fructose and mannose, whichare not 
substrates for SGLT1, do not activate glucose-responsive neurons. Fewer than 
10% of glucose responders were activated by either fructose or mannose. f, g, 
Summary of responses toa 10-s stimulus of 500 mM glucose for 33 neurons 
before and after intestinal application 8 mM phlorizin for 5 min (n=4 mice). 
Responses are severely diminished after blocker application (see Extended 
Data Fig. 10d, eand Methods). 


stimuli, including 1M glucose and 0.5 M salt. Our results show that such 
responses, which are largely independent of the quality of the stimulus, 
are not glucose-sensing nor are they required for the development of 
sugar preference, but rather represent responses to a wide range of 
high-osmolarity stimuli (Extended Data Figs. 8, 9). 


SGLT1 transduces gut-brain sugar signals 


We reasoned that the gut-to-brain signal might depend on known sugar 
sensors recruited into this role, perhapsina dedicated subpopulation 
of gut cells. Although the sweet-taste receptor is expressed in enteroen- 
docrine cells”, it is not involved in this process, as sweet-taste receptor 
knockout (T1R2/3 KO) mice still exhibited normal sugar-preference 
behaviour (Fig. Ic). 

The principal glucose transporter (and sensor) in the intestine is the 
sodium-glucose-linked transporter-1 (SGLT1)®. This transporter is 
expressed in enterocytes as well as in enteroendocrine cells that secrete 
a wide range of hormones and bioactive molecules and are thought 
to also function as a conduit between the gut and the vagal nerve*”. 
Therefore, we investigated whether SGLT1 is required to transmit the 
gut-to-brain sugar signal by determining whether other substrates 
of SGLT1—galactose and the glucose analogue 3-O-methyl-D-glucose 
(3-OMG)”*—also activate the same vagal neurons as glucose. Indeed, 
neurons responding to intestinal glucose were also stimulated by 
3-OMG and galactose (Fig. 4d, Extended Data Fig. 10a, b). Critically, this 
circuit is dedicated to glucose, as other caloric sugars such as fructose 
and mannose (that are not substrates of SGLT1)” do not activate the 
glucose-responsive vagal neurons (Fig. 4e, Extended Data Fig. 10c), do 
not create a behavioural preference (Extended Data Fig. 8e), but still 
trigger osmolarity responses (Extended Data Fig. 8). 


Next, we assessed whether pharmacological inhibition of SGLT1 
abolishes the glucose-dependent neuronal responses. We examined 
the responses to two consecutive 10-s stimuli of intestinal glucose 
before and after a 5-min wash of the intestinal segment with phlorizin”, 
an SGLT1 blocker. Our results (Fig. 4f, g, Extended Data Fig. 10d, e), 
demonstrated a marked loss of glucose responses following intesti- 
nal application of phlorizin”. Together, these results place SGLT1 as 
an important component of the sugar-preference signalling circuit. 
It will be of interest to determine the identity of the intestinal cells 
mediating these responses, as they represent another potential target 
for modulating this circuit. 


Co-opting the sugar-preference circuit 


The results presented above reveal a specific circuit via the vagal gan- 
glia to the brain critical for driving the development of preference for 
sugar. We devised an experiment to determine whether the selective 
activation of this circuit can be recruited to create a preference toapre- 
viously less-preferred stimulus. Our strategy was to identify a genetic 
driver that marks sugar-preference neurons inthe cNST, and then link 
their activation to the ingestion ofa novel stimulus. 

We examined the Allen Brain Atlas for candidate genes with enriched 
expression in the cNST, and tested candidates for glucose-evoked 
Fos labelling (Fig. 5a). Our results demonstrated that proenkepha- 
lin (Penk)-expressing neurons in the cNST®, marked by a Penk-cre 
construct driving td Tomato (Penk-cre;Ai75D), respond strongly to sugar 
stimuli (Fig. 5b, c); approximately 85% of the sugar-induced Fos-labelled 
neurons inthe cNST are Penk-positive, and over 80% of the Penk-positive 
neurons were labelled by Fos after sugar ingestion. 

We injected a Cre-dependent AAV encoding the excitatory 
designer receptor hM3Dq“* into the cNST of Penk-cre mice, so that 
Penk cNST neurons could be experimentally activated by the hM3Dq 
agonists clozapine N-oxide or clozapine***. After 8 days to allow for 
receptor expression, mice were exposed to two-bottle preference assays 
using artificially sweetened cherry-flavoured versus grape-flavoured 
solutions (Fig. 5d). Under this paradigm, the cherry solution was 
made sweeter than grape (see Methods) so that the animals would be 
significantly more attracted to the cherry flavour (Fig. 5e). Next, we 
introduced clozapine into the less-preferred grape flavour, and investi- 
gated whether clozapine-mediated activation of the Penk cNST neurons 
(much like glucose-mediated activation) can create a new preference. 
Indeed, after 48 h of exposure to the grape plus clozapine bottle, mice 
completely switched their preference, even though the grape solution 
was far less sweet than the cherry solution (Fig. Se, purple lines). To 
demonstrate that the preference switch is independent of the nature 
of the initially less-preferred stimuli, we flipped the starting flavours 
so that cherry was less favoured, and obtained an equivalent switchin 
preference (Fig. Se, red lines). As anticipated, wild-type mice without 
the designer receptor were indifferent to clozapine and continued to 
prefer the sweeter solution (Fig. 5f). These results demonstrate that 
artificial activation of the sugar-preference circuit is sufficient to drive 
the development of anovel preference to an otherwise low-preference 
stimulus. 


Discussion 
Sugar is an essential energy source across all animal species, and it is 
therefore expected that selective circuits be dedicated to seek, recog- 
nize and motivate its consumption. The discovery of this gut-to-brain 
circuit provides a powerful pathway to help to meet these needs. 
Inthis study, we show that glucose acts in the gut to activate a neural 
circuit that communicates to the brain the presence of sugar. What is 
the advantage of a gut-to-brain sugar-detection system in addition to 
the taste system? A post-ingestive sensing system in the gut assures 
that signalling only occurs after the sugar molecules reach their desired 
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Fig. 5 | Activation of sugar-responsive cNST neurons confers novel flavour 
preference. a—c, Penk-cre mice were stimulated with 600 mM glucose and brain 
slices were analysed for Fos and Penk labelling. Penk neurons were marked by 
expression of nuclear-localized tdTomato (Ai75D reporter line)*®. b, Low- 
magnification section of the brain stem (bregma -— 7.5 mm) showing Penk 
expression (red); tissue was counterstained with DAPI (blue). n = 2 independent 
experiments. Scale bar, 500 tm; cNST, yellow box. c, Sugar-preference neurons 
express Penk. Penk neurons labelled with tdTomato (from b) and 
glucose-activated neurons (Fos-labelled) marked green. Note the high degree of 
overlap in the merged image. Approximately 85% of sugar-activated cNST 
neurons are marked by Penk, and about 90% of cNST Penk neurons show 
sugar-Fos labelling (n =3 mice). Scale bar, 20 jum. d, Expression of activating 
DREADD receptor*** (via AAV-DIO-hM3Dq) was targeted bilaterally to the CNST 
of Penk-cre mice. The mice were then tested for their preference between two 
flavours for 48 h (Pre). Shown is an example using cherry (containing 2 mM AceK) 
versus grape (with 1 mM AceK). Mice were conditioned and tested using the 
less-preferred flavour plus the DREADD agonist clozapine (Post; see Methods). 
e, Penk-hM3Dq mice initially prefer the sweeter solution. After associating 
clozapine-mediated activation of Penk cNST neurons with the less-preferred 
flavour, all the Penk-hM3Dq mice switched their preference (Pre, 18.1+ 2.7%; 
Post, 61.1+ 5.5%; n=8 mice; two-sided Mann-Whitney U-test, P=1x 10). The 
experiment was carried out using grape (purple lines) or cherry (red lines) as the 
initially less-preferred stimuli. f, Mice not expressing the DREADD receptor are 
unaffected by the presence of clozapine (Pre, 19.0 + 3.0%; Post, 21.4+4.0%;n=8 
mice); control mice were subjected to the same conditioning and testing as the 
experimental cohort. Values are mean+s.e.m. 


target for effective absorption and metabolic consumption. The asso- 
ciation between the activation of this gut-to-brain circuit paired with 
the recognition of sugar by the taste system affords animals the funda- 
mental capacity to identify, develop and reinforce a strong and durable 
preference for sugar-rich food sources. The evolutionary association 
of these two separate circuits combines nutrition with the basic sense 
of taste. In the future, it would be of interest to determine whether 
preference for other essential nutrients also utilizes this gut—brain axis. 

Notably, artificial sweeteners were introduced in consumer products 
more than four decades ago; however, their overall impact in decreas- 
ing sugar consumption, preference and craving has been negligible. 
This may now be understood at the circuit level (that is, as—in contrast 
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to sugar—they do not activate the preference circuit), and implies that 
it may be possible to develop a new class of sweeteners that activate 
both the sweet-taste receptor in the tongue and the gut-brain axis, 
and consequently help to moderate the strong drive to consume sugar. 
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Methods 


Mice 

All procedures were carried out in accordance with the US National 
Institutes of Health (NIH) guidelines for the care and use of laboratory 
animals, and were approved by the Institutional Animal Care and Use 
Committee at Columbia University. Adult animals older than 6 weeks 
of age and from both genders were used in all experiments. C57BL/6J 
(JAX 000664), Arc-creER (TRAP, JAX 021881), TRAP2 (JAX 030323), 
TRPMS KO (JAX 013068), T1R2/3 KO (generated in house, JAX 013065 
and 013066), Ai96 (JAX 028866), VGlut2-IRES-Cre (JAX 028863), 
Gpr65-IRES-Cre (JAX 029282), Penk-IRES2-Cre (JAX 025112), Ai75D 
(JAX 025106) and R26-TeNT (MGI 3839913). 


Fos stimulation and immunohistochemistry 

Animals were water restricted for 23 h, given access to 1 ml of water 
for Lh, and then water restricted again for another 23 h. The stimulus 
consisted of 600 mM glucose, 600 mM MDG, 600 mM sucrose, 600 mM 
3-OMG, 600 mM galactose, 30 mM AceK or milliQ water for a period of 
90 mininthe absence of food. For intra-gastric infusion experiments, 
food was removed from the cage 12 h before stimulus infusion. A 
syringe pump microcontroller (Harvard Apparatus) was used to deliver 
1.5mlofthe stimulus solution at 0.075 ml min“. After 90 min, mice were 
perfused transcardially with PBS followed by 4% paraformaldehyde. 
Brains were dissected, and fixed overnight in paraformaldehyde at 
4 °C. The brains were sectioned coronally at 100 xm, and labelled with 
anti-c-Fos (Santa Cruz, sc-52 goat, 1:500; or SYSY, no. 226004 guinea 
pig, 1:5,000) in 5% normal donkey serum (EMD Millipore, Jackson Immu- 
noResearch) in 0.3% Triton X-100 in 1x PBS for 48 h at 4 °C with gentle 
shaking, and then Alexa Fluor 488-, 568- or 647-conjugated donkey 
anti-goat or anti-guinea pig (Jackson ImmunoResearch) in 5% normal 
donkey serum in 0.3% Triton X-100 in 1x PBS for 24 hat 4 °C with gentle 
shaking. Images were acquired using an Olympus FluoView 1000 con- 
focal microscope. Larger field-of-view images were acquired using a 
Nikon AZ100 Multizoom Slide Scanner. Quantification of Fos-labelled 
neurons was done by manual counting in a 300 x 300 um region of 
interest (ROI) inthe right cNST. 

For intragastric stimulation, animals were anaesthetized with keta- 
mine and xylazine (100 mg kg and 10 mg kg“, intraperitoneal). The 
stomach was exteriorized through an abdominal incision, and a Silastic 
(Dow Corning) tubing was inserted into the forestomach region and 
secured with silk sutures’. The other end was tunnelled subcutaneously 
along the left flank and exteriorized at the dorsal neck area. Mice were 
individually housed and allowed to recover for at least 5 days before 
stimulus infusion. 


Two-bottle preference assays 

The preference-switch experiments were carried out in standard mouse 
cages holding a custom designed 3D-printed scaffold for two bottles. 
Each bottle was outfitted with an electronic licking sensor, and access 
to the licking spout was controlled by a mechanical shutter. Mice were 
not water deprived before the experiment and had ad libitum access 
to food throughout. For behavioural tests, mice were first tested for 
their initial preference by completing 100 drinking trials. Each trial 
consisted of a choice between 600 mM glucose (or 600 mM MDG) and 
30 mM AceK. Trials lasted 5 s and were initiated after the first lick to 
either bottle; inter-trial intervals were 40s. To familiarize animals with 
the two choices, mice were required to complete 500 licks to 600 mM 
glucose alone, and 30 mM AceK alone; this was repeated twice. Animals 
were tested for their sugar (or MDG) versus sweetener preference over 
36 husing 5-s trials. Preference indexes: Pre, the number of licks to 
glucose divided by the total number of licks during the first 100 trials 
of baseline measurements; Post, the number of licks to glucose divided 
by the total number of licks during the last 100 trials of the behavioural 
session. Because T1R2/3 double knockouts cannot taste sweet, they 


are often averse to the ‘bitter’ in high concentrations of AceK (that is, 
not being countered by its high sweetness), therefore they were tested 
with 300 mM sucrose versus 5 mM AceK. 


Molecular cloning of custom pAAV constructs 
pAAV.hSyn.FLEX-eGFP-Rpl10a.WPRE.hGH-pA is constructed by ligation 
of two fragments: the eGFP pAAV backbone fragment was generated 
by digestion of pAAV-FLEX-EGFPL10a, a gift from N. Heintz, A. Nectow 
and E. Schmidt (Addgene plasmid 98747), with Mlul and KpnI, and the 
hSyn fragment with corresponding restriction ends was generated 
from pAAV-hSyn-DIO-hM4D(Gi)-mCherry, a gift from B. Roth (Addgene 
plasmid 44362). 

pAAV.CBA.FLEX-GFP-TeTx.WPRE.bGH-pA (TetTox) DNA isa gift from 
P. Wolff. 

All pAAVs were amplified in recAl NEB Stable cells and extracted by 
maxiprep (Zymo Research), and serotype 2/9 AAVs were produced by 
the Janelia Farms viral core. 


Genetic access to sugar-preference neurons 

The TRAP”®”?* strategy was used in TRAP2 mice to gain genetic access to 
sugar-activated neurons inthe cNST. The 4-hydroxytamoxifen (4OHT, 
Sigma H6278) was prepared as previously described*. AAV-injected 
TRAP2 mice were singly housed, water restricted for 23 h, given access 
to1ml of water for 1h, water restricted again for another 23 h, andthen 
presented with 600 mM glucose (or 30 mM AcekK) ad libitum, in the 
absence of food and nesting material. After 1h, mice were injected intra- 
peritonially with 12.5 mg kg? 40OHT, and placed back to the same cage 
for an additional 3 h. At the end of the 4 h of sugar or AceK exposure, 
mice were returned to regular home-cage conditions (group-caged, 
with nesting material, ad libitum food and water). Mice were used for 
experiments a minimum of 7 days after this TRAP protocol. 


Stereotaxic surgery 

For stereotaxic injections of reporter virus, mice were anaesthetized 
with ketamine and xylazine (100 mg kg‘ and 10 mg kg“, intraperito- 
neal), and placed into a stereotaxic frame with a closed-loop heating 
system to maintain body temperature. For retrograde monosyn- 
aptic tracing, animals were unilaterally injected with 100 nl of a1:1 
mixture of AAVs carrying Cre-dependent rabies TVA and glycopro- 
tein G (AAV1 EFla-FLEX-TVA-mCherry, UNC vector core, and AAV1 
FLEX-nGFP-2A-N2c(G) (a gift from T. Reardon)”, and a pseudotyped 
rabies virus carrying dsRed (RABV N2C(Delta G)-dsRed-EnvA, a gift from 
T. Reardon)*. cNST coordinates (Paxinos stereotaxic coordinates”) 
used for injections are relative to bregma and skull surface: caudal 7.5 
mm, lateral +0.3 mm, ventral 3.7-4 mm. 


Monosynaptic retrograde tracing and silencing experiments 
For retrograde monosynaptic tracing, Arc-CreER (TRAP)”* mice were 
allowed to recover 3 weeks after AAV injection, and the TRAP procedure 
was carried out as described above, except that 40HT was prepared 
incorn oil’, and was injected 1h before stimulus presentation. After 7 
days, EnvA-RABV was injected into the same site. Mice were euthanized 
2 weeks after the RABV injection and examined for expression of starter 
cells (nGFP and dsRed) and their monosynaptic inputs (dsRed)*>". 
For synaptic inhibition experiments, sugar-TRAP cNST neurons were 
bilaterally injected with 300 nl of AAV carrying Cre-dependent TetTox 
(AAV9 CBA.FLEX-TetTox)™. 


Synaptic-silencing experiments 

C57BL/6J and Trap2”"™ mice expressing TetTox in the cNST were tested 
in the two-bottle sugar versus sweetener preference assay for 48 h. 
For the first day, mice were acclimatized by exposure to AceK versus 
water, the second they were given glucose versus water, and the third 
and fourth days they were tested for their preference to sugar versus 
sweetener. To ensure silencing did not affect sweet-taste detection, 
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mice were also examined for their attraction to sugar versus water 
(second day) as well as artificial sweetener versus water (first day). 
Fraction consumed for sugar versus AceK on days 3-4 were calculated 
as (volume of glucose consumed)/(total volume consumed). Fraction 
consumed for AceK versus water was calculated as (volume of AceK 
consumed)/(total volume consumed). 


Fibre photometry, gut stimulus delivery and vagotomy 
Vglut2-cre;Ai96 animals were placed in a stereotaxic frame and 
implanted with a 400 pm core, 0.48 NA optical fibre (Doric Lenses) 
50-100 um over the right cNST. Photometry experiments were con- 
ducted aminimum of 13 days after fibre implantation surgery. Real-time 
population-level GCaMP fluorescence was recorded using a RZS5P fibre 
photometry system with Synapse software (Tucker Davis Technolo- 
gies) as described previously”. In brief, sinusoidally modulated 465 
nm and 405 nm light from light-emitting diodes (Doric Lenses) were 
combined via a multi-port fluorescence mini-cube into a fibre patch- 
cord connected to the mouse, and real-time demodulated emission 
signals were saved offline for analysis. Calcium-dependent signals 
Fygsnm Were compared with calcium-independent GCaMP fluorescence 
F495nmto control for movement and bleaching artefacts. The data was 
de-trended by first applying a least-squares linear fit to produce 
Fricced 40snm» aNd AF/F was calculated as (Fy65nm ~ Fitted 405nm)/F fitted 405 nm 
(ref. 7’). Data from each mouse were then normalized to peak fluores- 
cence (calculated as a 10-s window around the peak point), and pre- 
sented as normalized responses. For each stimulus, the normalized 
two-trial average was plotted and smoothed over a moving average. To 
quantify effects of vagotomy, we calculated the ratio of stimulus-related 
peak amplitude of the normalized trace (within120 s of stimulus onset) 
before and after the procedure. 

To deliver intestinal stimuli, all animals were anaesthetized with 
ketamine and xylazine (100 mg kg‘ and 10 mg kg“, intraperitoneal), 
re-dosing was performed as necessary with ketamine only (33 mg kg’). 
Mice were immobilized as previously described”, positionedina supine 
position, with the head rigidly secured using the metal bar. To ensure 
aclear airway, the mouse was tracheotomized. An incision was made 
into the greater curvature of the stomach, the tip of the catheter was 
inserted past the pyloric sphincter and secured by a suture into the 
duodenal bulb. Another suture was tied around the catheter and stom- 
ach to prevent spillage of gastric contents. Upon implantation of the 
catheter, the intestines were filled with 1 ml of PBS and an exit port cut 
at the most distally inflated intestinal segment, approximately 12 cm 
from the catheter. The intestines were flushed with PBS for 5 min at 
150 pI min“ before the beginning of each experiment. Stimulus delivery 
was performed via a series of peristaltic pumps (BioChem Fluidics) 
operated via custom Matlab software/ Arduino microcontroller. Stimuli 
and washes were delivered through separate lines that converged ona 
common perfusion manifold (Warner Instruments) connected to the 
duodenal catheter. All trials were 7-min long and consisted of a120-s 
baseline (PBS 150 pl min”), a60-s stimulus (200 pl min”), anda4-min 
washout period (180 s at 600 pl min”, and 60 sat 150 pl min”). Stimuli 
were each presented twice in an interleaved fashion. All chemicals were 
obtained from Sigma and dissolved in 1x PBS at the following concentra- 
tions: 30 mM AceK, 500 mM glucose and 500 mM MDG. 

The vagotomy procedure was carried out immediately after the first 
round of stimuli. Salivary glands were cauterized and removed. Then, 
skin around the tracheotomy tube was retracted to expose the cervical 
trunk of the vagus nerve running in close proximity tothe carotid artery. 
The nerve was carefully dissected from the underlying vessels using 
fine Dumont forceps and fully transected by a pair of Vannas scissors™. 


Genetic vagal silencing experiments 

Vglut2-cre animals were anaesthetized with ketamine and xylazine 
(100 mg kg‘ and 10 mg kg", intraperitoneal). Ophthalmic ointment 
was applied to the eyes, and subcutaneous injections of carprofen 


(5mg kg”) and buprenorphine (0.05 mg kg“) were given to each mouse 
before surgery. The skin under neck was shaved and betadine and alco- 
hol were used to scrub the skin three times. A midline incision (-1.5 
cm) was made and the trachea and surrounding muscles were gently 
retracted to expose the nodose ganglia. Then, AAV9 CBA.FLEX-TetTox 
(600 nl per ganglion) containing Fast Green (Sigma, F7252-5G) was 
injected in both left and right ganglia using a 30° bevelled glass pipette 
and Nanolitre 2000 microinjector positioned with a micromanipulator. 
Virus was injected in 60-nl pulses and ganglion targeting was visualized 
with the dye. At the end of surgery, the skin incision was closed using 5-0 
absorbable sutures (CP medical, 421A). Mice were allowed to recover 
fora minimum of 26 days before behavioural testing. We note that 50% 
of the animals survived the surgical procedure and bilateral injections. 


Vagal calcium imaging 

Vglut2-cre;Ai96 or Gpr65-Cre;Ai96 mice were anaesthetized, tracheoto- 
mized, and positioned ona surgical platform (Thorlabs breadboard). 
The nodose ganglion was then exposed by severing the posterior 
tendon of the digastric muscle, cauterizing the occipital branch of the 
carotid artery and dissecting the trunk of the nerve. Then the prepara- 
tion was affixed to a set of manual goniometric stages (Newport Instru- 
ments) allowing for angular rotation about the longitudinal and lateral 
axes for optimal positioning under the microscope. Imaging was as 
previously described®. Imaging data was obtained using an Evolve 512 
EMCCD camera (Photometrics). Data was acquired at 10 Hz. A single 
field of view was chosen for recording and analysis from each mouse, 
each containing 80-150 segmented single neurons. 

To deliver intestinal stimuli for nodose calcium imaging, the duode- 
num was also exposed and catheterized as described above. A typical 
trial was 5 min long and consisted of a 60-s baseline (PBS 150 pil min”), 
a10-s (or 60-s) stimulus (200 pl min”), and a3-min washout period 
(120 s at 600 pl min“, 30 s at 1,800 pl min“, and 30s at 150 pl min”). 
Chemicals were dissolved in PBS: AceK, 30mM; glucose, 500 mM, MDG, 
500 mM; mannitol, 500 mM; galactose, 500 mM; 3-OMG, 500 mM. For 
SGLT1 blocker experiments, 8 mM phlorizin (Sigma) was dissolved in 
PBS with 3% w/v 1M NaOH (0.03 M NaOH final)"®, which was titrated 
back to pH 7.4. The blocker was used within 30 min of preparation. The 
intestines were pre-washed with PBS + phlorizin flowing at 150 pl min 
for 5 min before commencing the experiments, glucose 500 mM was 
diluted in PBS + 8 mM phlorizin. 

For retrograde labelling of vagal neurons from the duodenum, 
recombinant Alexa Fluor 594-conjugated CTB (Invitrogen C34777) 
was injected into the wall of the intestines. A total of 3 pl of 1O mg mI* 
(1%) CTB was injected across 10 sites in the duodenum (within 2. cm of 
the pylorus) using a 30° bevelled glass pipette connected to a Nanoli- 
tre 2010 microinjector (WPI). The pipette was inserted into the outer 
muscular layer of the intestines (that is, not luminally) at an acute angle. 
Mice were used for calcium imaging experiments 3-5 days after CTB 
injection. 


Calcium-imaging data collection and analysis 

Calcium-imaging data collected at 10 Hz was downsampled by a fac- 
tor of 3, and the images stabilized using the NoRMCorre algorithm®. 
Motion corrected movies were then manually segmented in Image] 
using the Cell Magic Wand plugin (https://www.maxplanckflorida.org/ 
fitzpatricklab/software/cellMagicWand/). Only ROIs whose average 
fluorescence was greater than the surrounding neuropil in more than 
10% of frames were used for further analysis. Neuropil fluorescence was 
subtracted from each ROI with the FISSA toolbox”, and neural activity 
was denoised using the OASIS deconvolution algorithm”. 

Neuronal activity was analysed for significant stimulus-evoked 
responses, as described in ref. *°, with the following modifications. To 
determine the baseline to calculate z-scores, traces were smoothed over 
a15-s moving window, and a baseline distribution of deviations from the 
median for each cell over the entire experiment was calculated using 


periods preceding the stimulus onset. This baseline was then used to 
calculate a modified z-score by subtracting the median and dividing by 
the median absolute deviation. Trials with an average modified z-score 
above 1.6 for the 90s following presentation of the stimulus were clas- 
sified as responding trials, and a cell was required to respond in more 
than 60% of stimulus trials to be classified as a responder. This criterion 
was validated against visual identification of responses by independent 
investigators and accurately identified >90% of the same cells with less 
than 5% false-positive rate. Only cells that reached at least 2% dF/F for 
the first two trials of glucose were included in heat maps. Heat maps for 
each experiment were normalized across stimuli, so different stimuli are 
directly comparable. We note that there were no significant numbers of 
MDG-only responses (~95% of the neurons that responded to MDGalso 
responded to glucose; a total of 168 MDG responders were analysed and 
159 showed responses to both). For the blocker and control data (Fig. 4, 
Extended Data Fig. 10) responses were filtered to ensure reliable trials 
preceding blocker addition (that is, the two responses before blocker 
addition had to be within 70% of each other). 


Chemogenetic-activation experiments 

For gain of preference experiments, Penk-cre animals were stereotaxi- 
cally injected with 300 nl of AAV carrying Cre-dependent activator 
DREADD (1-2 x 108 GC mI; AAV9 Syn-DIO-hM3Dq-mCherry, Addgene 
44361), bilaterally inthe cNST. At least 8 days was allowed for recovery 
and viral expression before behavioural testing. We note that in control 
studies, we validated that injections into the cNST did not infect vagal 
neurons. We examined six different ganglia with thousands of neurons 
and detected a total of only four labelled neurons (see also Fig. 3c for 
an example with AAV1). 

C57BL/6J and Penk mice expressing hM3Dq in the cNST were tested 
in atwo-bottle grape versus cherry flavour-preference assay. Grape 
solution was 0.39 gl Kool-Aid Unsweetened Grape (00043000955635) 
in] mM AceK in milliQ water; cherry solution was 0.9 g I" Kool-Aid 
Unsweetened Cherry (00043000955628) in2 mM AceK in milliQ water. 
For the first 48 h, animals were tested for their initial preference (Pre) 
between solutions. Mice were then exposed to cherry only for 2x 24h 
sessions (days 3 and 5), and grape plus 0.005 g I" clozapine dihydro- 
chloride (Hello Bio, HB6129-SOmg) for 2 x 24 h sessions (days 4 and 6) 
for conditioning. Mice were then assayed for their preference after 
the conditioning sessions on days 7 and 8. Initial preference is calcu- 
lated as the average of days 1-2 (volume of grape solution consumed)/ 
(total volume consumed), and post-conditioning preference is similarly 
calculated from days 7-8. To demonstrate that the switch is independ- 
ent of the nature of the initial less-preferred stimuli, 4/8 C57BL/6J and 
4/8 Penk-hM3Dq mice were tested with reversed flavour conditions 
(that is, conditioned to 0.9 gl cherry in1 mM AceK with clozapine). 


Brief-access preference assay 

C57BL/6J mice were tested for their immediate taste preferences ina 
short-access two-bottle preference assay’. Singly housed naive mice 
were acclimatized in new cages with access to two bottles of water 
overnight. Animals were then water deprived for 1h and presented with 
600 mM glucose versus 600 mM MDG for 1h. Preference for glucose was 
calculated as (volume of glucose consumed)/(total volume consumed). 


Insulin and glucose measurements 

Plasma insulin and blood glucose measurements were performed as 
previously described*’. Male C57BL/6J mice were group-housed in cages 
with wood chip bedding. Mice were habituated to scruffing and blood 
draws at least twice before the experiment. On the day of sample collec- 
tion, animals were subjected to a5-h fast (food removed and transferred 
toclean cages) beginning at the lights-on period of the light-dark cycle 
(07:00). Mice were gavaged with 555 mM glucose or MDG at 2mg kg”. 
Blood (-100 ul) was collected before and 15 min after gavage into a 
chilled heparin-coated Eppendorf tube. Glucose measurements were 


taken on whole blood via hand-held glucometer (OneTouch Verio). For 
insulin measurements, samples were run ona mouse insulin ELISA kit 
(Mercodia, 10-1247-01) according to the manufacturer’s directions. 


Retrograde labelling of vagal neurons from brainstem 

C57BL/6J mice were stereotaxically injected with 50 nl of red or green 
fluorescent RetroBeads (Lumafluor) in the cNST or Cuneate nucleus 
(Paxinos stereotaxic coordinates relative to Bregma and skull surface: 
caudal 7.5 mm, lateral 0.9 mm, ventral 3.6-3.9 mm). Mice were eutha- 
nized 6-7 days after RetroBeads injection. Prior to analysis, the brain- 
stem was sliced coronally to confirm accurate targeting to the cNST and 
Cuneate. Nodose and dorsal root ganglia (across the cervical, thoracic 
and lumbar segments)” were examined for fluorescent labelling. 


Statistics 

No statistical methods were used to predetermine sample size, and 
investigators were not blinded to group allocation. No method of 
randomization was used to determine how animals were allocated to 
experimental groups. Statistical methods used include one-way ANOVA 
followed by Tukey’s HSD post hoc test, two-tailed t-test, or the two-sided 
Mann-Whitney U-test, and are indicated for all figures. Allanalyses were 
performed in MATLAB. Data are presented as mean + s.e.m. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All data supporting the findings of this study are available from the 
corresponding author upon request. 


Code availability 
Custom code is available from the corresponding author. 
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Extended Data Fig. 1| Glucose and MDG preference. a, When mice are givena 
choice between 600 mM glucose or 600 mM MDG, using a brief-access (1h) 
test, naive animals display a small preference for glucose over MDG (n=5, 
two-tailed paired t-test, P= 0.0406), probably because MDG is slightly less 
sweet and thus not as attractive. Values are mean+s.e.m.b,c, Although the 
non-caloric sugar analogue MDGis very effective in causing a preference 
switch (see Fig. 1), it does not cause increases in plasma glucose or release of 
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insulin. Mice were gavaged with glucose or MDG, and plasma glucose and 
insulin levels were sampled before (Pre), and at 15 min after the gavage (Post). 
b, Plasma glucose after glucose gavage (red bars). n=7, two-tailed paired t-test, 
P=4x10°. Plasma glucose after MDG gavage (blue bars). n= 6, two-tailed 
paired t-test, P= 0.36.c, Plasma insulin levels after glucose gavage (red bars). 
n=7,two-tailed paired t-test, P=7 x 10~°. Plasma insulin levels after MDG gavage 
(blue). n= 6, two-tailed paired t-test, P=0.94. Values are mean+s.e.m. 
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Extended Data Fig. 2 | Fosresponses are robust and reliable. a, The brain represents a different animal, n=3 independent experiments. Note the 
diagram illustrates the position of the NST and the plane of the sectioning. robustness of the signals across animals. See Methods for details. b, Mice were 
ShownarecNST sections stained with Fos antibodies after exposing the stimulated with 600 mM 3-OMG (n=6 mice) or 600 mM galactose (n=3 mice) 
animals to 90 min of 600 mM sucrose, 600 mM glucose or 30 mM AceK. (see also Fig. 4, Extended Data Fig. 10). Note strong Fos signals incNST neurons, 
Each panel is aconfocal maximal projection image from Bregma-7.5mm n=2 independent experiments (total of 9 mice). Scale bars, 100 pm. 


consisting of 3 sections 15 pm apart. Each panel (sucrose, glucose or AceK) 
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Extended Data Fig. 3 | The development of sugar preference. a, Glucose demonstrating the robust development of preference for sugar versus artificial 
stimulates cNST neurons in mice lacking the sweet taste receptor (TIR2/3”), sweetener (see also Fig. 1). By contrast, silencing of the sensory neurons inthe 
orin mice lacking the TRPMS ion channel (TRPMS5” ). See Fig. le for nodose ganglia, by bilateral injection of AAV-DIO-TetTox into the nodose 


quantification. TIR2/3”, n=5 mice, ANOVA followed by Tukey’s HSD post hoc ganglia of Vglut2-cre mice (see Methods), abolishes the development of sugar 
test, P< 0.0001; TRPM5“,n=7 mice, ANOVA followed by Tukey’s HSD post hoc preference; n=3 mice, two-sided Mann-Whitney U-test, P= 0.035. Values are 


test, P< 0.0001. Values are mean + s.e.m. Scale bars, 100 pm. b, Direct mean +s.e.m.d, However, silencing vagal sensory neurons does not impair the 
intragastric infusion of glucose, but not AceK, robustly activatesthecNST.n=2 innate attraction to sweet solutions; shown are behavioural responses to AceK 
independent experiments. Scale bars, 100 pm. c, d, Genetic silencing of vagal versus water, and glucose versus water (n=3 animals, preference index for 


sensory neurons. c, Sugar-preference graphs for wild-type mice (n=5 mice), AceK = 0.82, preference index for glucose = 0.85). Values are mean +s.e.m. 
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Extended Data Fig. 4| Retrograde labelling from cNST. a, A fluorescent 
retrograde tracer (red RetroBeads, Lumafluor) was stereotactically injected 
into the cNST to label its inputs. The nodose ganglia and dorsal root ganglia 
were checked for transfer of the fluorescent label after 6-7 days. The nodose 
ganglion (vagal neurons), but not the dorsal root ganglion (spinal neurons), was 
robustly labelled®°. n=2 independent experiments. b, RetroBeads were also 
injected into the cuneate nucleus, a brainstem area near but distinct fromthe 
cNST. Vagal neurons were not labelled. By contrast, note robust labelling of 
spinal neurons (n=2 independent experiments). Nuclei were counterstained 
with DAPI (blue). Scale bars, 200 pm (Brainstem), 50 pm (nodose, DRG). 

c, Validation of TRAPing procedure to confirm that the sugar-activated cNST 
neurons marked by the expression of Fos are the same as the ones labelled by 


Cre recombinase in the genetic TRAPing experiments. We genetically labelled 
the sugar-induced TRAPed neurons witha Cre-dependent fluorescent 
reporter”, and then performed a second cycle of sugar stimulation followed by 
Fos antibody labelling. d, Top, neurons labelled by the Cre-dependent reporter 
after sugar TRAPing (‘sugar-TRAP’, pseudocoloured red) are the same as those 
labelled by Fos after a second cycle of sugar stimulation (‘sugar-Fos’, green; 

see Methods and text for details), >80% of Sugar-Fos neurons are also 
sugar-TRAP positive (n =7 animals). Middle, note that the few neurons labelled 
after water-TRAP in response to water do not overlap with those labelled with 
Fos antibodies after sugar stimulation. Bottom, the sugar-TRAP neurons are 
also activated by the non-caloric sugar analogue MDG; >80% of MDG-Fos are 
sugar-TRAP positive. Scale bar, 20 pm. 
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Extended Data Fig. 5| Mice witha silenced sugar-preference circuit behave 
as normal mice, drinking artificial sweeteners. a, A normal, non-thirsty 
mouse drinks about 5 ml of water during a 24-h window. n=11 mice. Values are 
mean +s.e.m.b, If presented with a sweet option (but not sugar, so as tonot 
create a preference) they showa small but significant increase in total volume 
consumed, but now most of the total consumption is from the sweet choice 
rather than water (n= 9 animals, two-tailed paired t-test, P=1 10). Values are 
mean +s.e.m.c, By contrast, ifthe options are water versus sugar, so that it 
creates a preference, they massively increase total volume consumed, and 
nearly all is from the sugar solution (n=9 animals, two-tailed paired t-test, 


P=3x10). Values are mean +s.e.m.d, As expected, wild-type controls 
develop astrong preference for sugar versus AceK (n=9 animals, two-tailed 
paired t-test, P=3 x 10°). Values are mean +s.e.m. e, f, Mice with the preference 
circuit silenced behave as control animals presented witha sweet, 
non-preference creating choice (compare e, f with b) (n=6 mice, two-tailed 
paired t-test, P=6 x 10* for AceK, P=4 x 10° for glucose). Values are 

mean +s.e.m. g, Silenced animals consumed nearly equal volumes of sugar and 
artificial sweetener (n= 6 animals, two-tailed paired t-test, P=0.1). Values are 
mean +s.e.m. 
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Extended Data Fig. 6| Vagal-neuron responses to sugar and MDGare highly 
reproducible and timed-locked to the stimulus. a, Shown are vagal-neuron 
responses to 6 consecutive 10-s intestinal stimuli of alternating trials with 

500 mM glucose and 500 mM MDG (stimulus delivery and timings are as 
described in the Methods). Each of the sample traces depicts the response from 
a different neuron. b, Shown are vagal-neuron responses to 5 consecutive 10-s 
intestinal stimuli with 5|00 mM glucose (stimulus delivery and timings are as 
described in the Methods). Each of the sample traces shows the response from 
adifferent neuron. c, Expanded time scale of responses to the 10-s 500 mM 
glucose stimulus from10s before to 10s after termination of the stimulus. The 
green dashed lines indicate the initiation of the stimulus, and the blue dashed 
lines denote termination of the 10-s stimulus. Calcium responses are shown in 
solid black and exponential fits to the response latency and kinetics are shown 
inred. Note responses time-locked to stimulus delivery; the top two traces 
depict two cells from two different mice in response to glucose, andthe bottom 
two traces depict two cells from two different mice in response to MDG; 
latencies varied between 3 and 6s, and were within the 10-s stimulation 
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window. Some cells exhibited longer latencies (see for example, heat maps in 
Fig. 4, Extended Data Fig. 8). We believe the cells with longer response latencies 
may represent intestinal glucose responders located farther down the 
intestinal segment, and thus would be expected to demonstrate longer 
latencies*’. d, On average, approximately 5% of vagal neurons respond reliably 
toal10-s 500 mM glucose stimulus. The histogram shows the percentage of 
GCaMP-expressing vagal neurons responding to the 10-s glucose stimulus. 
Average =4.6 + 0.05% (n=4,803 neurons from 51 ganglia, mean+s.e.m.).e, 
Recent findings” have suggested that appetitive behavioural responses are 
elicited through stimulation of vagal terminals originating from the right 
nodose ganglion. Shown are heat maps depicting z-score normalized average 
calcium responses of individual ganglion neurons after a 60-s pulse of 500 mM 
glucose. We observe no differences in responses to intestinal glucose from 
either the left or right vagal ganglia. Also shown are example traces from 
different neurons from the left and right Nodose ganglion; red bars indicate the 
60-s stimulus; scale bars indicate percentage maximal response. 
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Extended Data Fig. 7 | Vagal neurons innervating duodenal segment sense 
sugar.a, Top, schematic of retrograde tracing experiment. Fluorescently 
conjugated CTB** was injected into the proximal duodenum to back filland 
label the cell bodies of duodenum-projecting vagal neurons (z-projection of 
n=22 confocal planes froma representative ganglion, see Methods for details). 
The two bottom panels showa sample retrogradely labelled ganglion with 
sensory neurons (Vglut2-cre driving the GCaMP reporter) marked in green (left) 
and those labelled by CTB marked by red fluorescence (right). Double-positive 
neuronsare highlighted by the white circles. Scale bar, 100 pm. b, 
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Representative field of a vagal imaging session showing the overlay of CTB and 
GCaMP. Thetwo yellow circled neurons (denoted as #1 and #2) were labelled by 
retrogradely applied CTB in the duodenal segment, and exhibited strong 
responses to glucose (n= 16 ganglia from 10 mice). Scale bar, 100 pm.c, A total 
of 12 out of 55 double-positive neurons responded to the 10-s glucose stimulus 
(see Extended Data Fig. 6d for a comparison with uninjected animals). n=16 
ganglia from 10 mice. Note the substantial enrichment in the number of 
responders when pre-tagged by retrograde labelling: -20% in the duodenal 
tagged versus 4-5% in the whole population. 
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Extended Data Fig. 8 | Glucose responders are not sensing osmolarity. 
Williams et al.” identified vagal neurons that indiscriminately responded to 
highconcentrations of several stimuli delivered in very large stimulus volume 
for hundreds of seconds. We believe these responses, largely independent of 
the quality of the stimulus, are intestinal osmolarity signals. a, Shown are heat 
maps summarizing responses to interleaved 60-s stimuli of 500 mM glucose 
and 500 mM mannitol. Eachrow represents the average activity of a single cell 
during three interspersed exposures to the stimulus. Stimulus windowis 
indicated by the dashed white lines. Of 134 neurons that responded to 
intestinal application of 500 mM glucose for 60s, 101 did not exhibit 
statistically significant responses to mannitol (top). However, 33 (-25%) showed 
responses to both 500 mM glucose and 500 mM mannitol (bottom). n=5 mice. 
When theintestinal stimulus consisted of a short pulse (that is, 10 s; 33 pl 
volume) no responses were detected to 500 mM mannitol (data not shown). 

b, Sample traces (three trials each) of aneuron responding to glucose (red) but 
not mannitol (blue).c, Sample traces (three trials each) of aneuron responding 
to glucose and mannitol. Scale bars indicate percentage maximal response. 
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d, Heat maps showing responses toa 60-s stimuli of 1M mannitol, 1M fructose, 
1M mannose and1M NaCl. Note that the same cells respond indiscriminately to 
the various stimulus (n=4 mice). e, The graph shows preference plots for 
fructose versus AceK (n=8 mice, two-tailed paired t-test, P=0.27). Note that 
fructose, a caloric sugar, does not create preference, but activates osmolarity 
responses. f, Williams et al.7* suggest that GPR65-expressing vagal neurons 
function as the nutrient sensors. We generated mice in which GCaMP6s 
expression was targeted to GPR65-expressing vagal neurons and examined 
their responses toa 10-s stimulus of 500 mM glucose or osmolarity signals 
(that is, 1M each of fructose, mannose and NaCl for 60s). Shown are normalized 
responses of from three different mice to the four stimuli; each trace 
represents a different responding neuron. Note that 500 mM glucose for 10s 
does not activate GPR65 neurons. By contrast, they are activated by 60-s pulse 
of1M fructose, mannose and NaCl (see also Fig. 4).g, Summary histogram of 
GPR65 tuning profile to 10 s 500 mM glucose, and 60s1M fructose, 60s1M 
mannose and 60s1MNaCl;n=4 mice. 
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Extended Data Fig. 9 | Genetic silencing of GPR65 neurons does not affect 
the development of sugar preference. a, Global silencing of the GPR65 
neurons was achieved by generating GPR65-IRES-Cre; R26-TeNT double 
transgenic animals expressing TetTox in GPR65 neurons. Sugar-preference 
graphs demonstrating the robust development of preference for sugar versus 
artificial sweetener for both wild-type (n=5 mice, two-tailed paired t-test, 
P=0.0047) and GPR65:TetTox mice (n=5 mice, two-tailed paired t-test, 
P=0.0033). The wild-type controls shown here are the same mice used in 
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Extended Data Fig. 3c, as both sets of silencing experiments were carried out as 
part of the same series of studies. Values are mean +s.e.m. b, Silencing of GPR65 
neurons does not impair the innate attraction to sweet solutions. Shown are 
behavioural responses to AceK versus water and glucose versus water (n=5, 
two-tailed paired t-test, P= 0.0040 for consumed volumes of AceK versus 
water, P= 0.0023 for consumed volumes of glucose versus water). Values are 
mean +s.e.m. 
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Extended Data Fig. 10| Vagal neurons responding to intestinal glucose are 
also activated by SGLT1 agonists. a, Traces of vagal neurons responding toa 
10-s pulse of 500 mM intestinal glucose, also challenged with a10-s pulse of 
500 mM 3-OMG. Shownare sample neurons from 2 animals. b, Traces of vagal 
neurons responding toa 10-s pulse of S00 mM intestinal glucose, also 
challenged with a10-s pulse of 500 mM galactose. Shown are sample neurons 
from two animals for expanded time scales (from Fig. 4d).c, Traces of vagal 
neurons responding toa 10-s pulse of 500 mM intestinal glucose, also 
challenged with a10-s pulse of 500 mM fructose and 500 mM mannose. Shown 
are sample neurons from three mice. d, Traces of vagal neurons responding to 
two consecutive 10-s pulses of 500 mM intestinal glucose, before and after 
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treating the intestinal segment with 8 mM phlorizin for 5 min. Note the loss of 
responses. e, Because responses, in general, show some decay during the time 
of the experiment (in part due to desensitizing and bleaching of the fluorescent 
signals), we also analysed the average decay of corresponding glucose 
responses in the absence of any blocker. The graphs compare the loss of 
responses during normal decay, andin response to the blocker. For normal 
decay (left), n=11 neurons, Pre=230.8 arbitrary units (a.u.), Post = 172.8 a.u.; 
for blocker (right), n=31 neurons, Pre =229.7 a.u., Post = 67.0 a.u. All values are 
mean +s.e.m. Scale indicates average integral of the responses to the two trials 
before and after inhibition. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample size was determined based similar studies in the literature and our experience. No statistical method was used to determine the 
sample size prior to the study. 


Data exclusions Animals in which post-hoc histological examination showed that viral targeting or the position of implanted fiber were in the incorrect 
location were excluded from analysis. This exclusion criteria was predetermined. 


Replication We performed multiple independent experiments as noted in the figure legends. Results were reproducible. 


Randomization — Stimuli order was random, otherwise in situations as described in the manuscript where no randomization was used, the stimuli were 
interspersed and repeated among trials. 
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Blinding Investigators were not blinded to group allocation, as data analysis was performed automatically with the same scripts executed for each 
experimental group. 
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system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
L_| Antibodies [| ChIP-seq 
|__| Eukaryotic cell lines | Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used anti c-Fos (Santa Cruz, SC52G, K1715, Goat, 1:500), anti c-Fos (Synaptic Systems, 226004, Guinea Pig, 1:5000) 


Validation Both antibodies has been validated extensively, e.g. by immuno-staining on mouse brain sections (Choi, et al. Cell, 146, 
1004-1015, (2011), Song, et al. Science advances, 52: eaat3210, (2019)). 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Adult animals 6-24 weeks of age and from both genders were used in experiments. C57BL/6J (JAX #000664), ArcCreER (TRAP, 
JAX #021881), TRAP2 (JAX #030323), TRPM5 KO (JAX #013068), T1R2/T1R3 double KO (generated in house, JAX 
#013065/013066), Ai96 (JAX #028866), Vglut2-IRES-Cre (JAX #028863), Gpr65-IRES-Cre (JAX #029282), Penk-IRES2-Cre (JAX 
#025112), Ai75D (JAX #025106), R26-TeNT (MGI #3839913). 


Wild animals No wild animals were used. 
Field-collected samples No field-collected samples were used. 
Ethics oversight All procedures were carried out in accordance with the US National Institutes of Health (NIH) guidelines for the care and use of 


laboratory animals, and were approved by the Institutional Animal Care and Use Committee at Columbia University. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


7 12GO12Q 


8107 


Article 


Mechanisms and therapeutic implications of 
hypermutation in gliomas 


https://doi.org/10.1038/s41586-020-2209-9 


Received: 22 July 2019 


Accepted: 4 March 2020 


Published online: 15 April 2020 


® Check for updates 


Mehdi Touat'”***™, Yvonne Y. Li2**4, Adam N. Boynton2*, Liam F. Spurr?“, 

J. Bryan lorgulescu**, Craig L. Bohrson’”*, Isidro Cortes-Ciriano’, Cristina Birzu’, 

Jack E. Geduldig', Kristine Pelton’, Mary Jane Lim-Fat*”’, Sangita Pal”*, Ruben Ferrer-Luna 
Shakti H. Ramkissoon"”, Frank Dubois”*, Charlotte Bellamy', Naomi Currimjee’, 

Juliana Bonardi’, Kenin Qian®, Patricia Ho®, Seth Malinowski’, Leon Taquet', Robert E. Jones’, 
Aniket Shetty'’, Kin-Hoe Chow's, Radwa Sharaf", Dean Pavlick", Lee A. Albacker", 

Nadia Younan*®, Capucine Baldini“, Maité Verreault’®, Marine Giry”, Erell Guillerm’®, 

Samy Ammari"”’, Frédéric Beuvon’, Karima Mokhtari”°, Agusti Alentorn®, Caroline Dehais?, 
Caroline Houillier*, Florence Laigle-Donadey*, Dimitri Psimaras*, Eudocia Q. Lee*”°, 

Lakshmi Nayak*”, J. Ricardo McFaline-Figueroa*”’, Alexandre Carpentier”, Philippe Cornu”, 
Laurent Capelle”’, Bertrand Mathon”, Jill S. Barnholtz-Sloan”2, Arnab Chakravarti?°, 

Wenya Linda Bi”, E. Antonio Chiocca™, Katie Pricola Fehnel”, Sanda Alexandrescu”®, 

Susan N. Chi>’, Daphne Haas-Kogan”, Tracy T. Batchelor*”°, Garrett M. Frampton", 

Brian M. Alexander”, Raymond Y. Huang”, Azra H. Ligon®, Florence Coulet"®, 
Jean-Yves Delattre**°, Khé Hoang-Xuan*, David M. Meredith'®, Sandro Santagata 
Alex Duval*, Marc Sanson®*°, Andrew D. Cherniack’“, Patrick Y. Wen*”°, David A. Reardon’, 
Aurélien Marabelle™, Peter J. Park’, Ahmed Idbaih?, Rameen Beroukhim?*”"=™, 

Pratiti Bandopadhayay”>?”*°™, Franck Bielle?°*>~ & Keith L. Ligon'’?®"%7535= 


2,4,11 
, 


1,6,31,32 
, 


A high tumour mutational burden (hypermutation) is observed in some gliomas’ °; 
however, the mechanisms by which hypermutation develops and whether it predicts 
the response to immunotherapy are poorly understood. Here we comprehensively 
analyse the molecular determinants of mutational burden and signatures in 10,294 
gliomas. We delineate two main pathways to hypermutation: a de novo pathway 
associated with constitutional defects in DNA polymerase and mismatch repair 
(MMR) genes, and amore common post-treatment pathway, associated with acquired 
resistance driven by MMR defects in chemotherapy-sensitive gliomas that recur after 
treatment with the chemotherapy drug temozolomide. Experimentally, the 
mutational signature of post-treatment hypermutated gliomas was recapitulated by 
temozolomide-induced damage in cells with MMR deficiency. MMR-deficient gliomas 


were characterized by a lack of prominent T cell infiltrates, extensive intratumoral 
heterogeneity, poor patient survival and alow rate of response to PD-1 blockade. 
Moreover, although bulk analyses did not detect microsatellite instability in 
MMR-deficient gliomas, single-cell whole-genome sequencing analysis of 
post-treatment hypermutated glioma cells identified microsatellite mutations. 
These results show that chemotherapy can drive the acquisition of hypermutated 
populations without promoting a response to PD-1 blockade and supports the 
diagnostic use of mutational burden and signatures in cancer. 


Identifying genomic markers of response to immune checkpoint 
blockade (for example, PD-1 blockade) may benefit cancer patients 
by providing predictive biomarkers for patient stratification and 
identifying resistance mechanisms for therapeutic targeting. Glio- 
mas typically have a low tumour mutational burden (TMB) and a 
highly immunosuppressive microenvironment—two features associ- 
ated with immunotherapy resistance. Nevertheless, recent work has 
suggested that a subset of patients with high-TMB (hypermutated) 
gliomas might benefit from PD-1 blockade’®. Although consistent with 


data from other cancers’ ®, these initial observations were derived 
fromuniquediseasecontextssuchasconstitutional DNA mismatch-repair 
(MMR) deficiency syndrome’. Therefore, the extent to which 
glioma patients at large will benefit from this approach is unknown. 
While large amounts of genomic data on gliomas exist?*>"°"", our 
understanding of the clinical landscape of hypermutation and the 
mechanisms that underlie its development remain unclear. Hypermu- 
tation is rare in newly-diagnosed gliomas (de novo hypermutation), 
but common intumours that have recurred after the use of alkylating 
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agents (post-treatment hypermutation)**"°™. Given that gliomas 
exhibit substantial inter-patient and intra-tumoral genomic varia- 
tion’, it remains to be determined whether molecular biomarkers 
(for example, /DH1 or [DH2 (hereafter /DH1/2) mutations) reliably pre- 
dict the development of hypermutation or response to immunotherapy. 
An association between hypermutation and MMR mutations has 
been observed in gliomas’ *”, but most of the reported MMR muta- 
tions were not functionally characterized, and their role in causing 
hypermutation is unclear. Other studies have suggested that alkylating 
agents such as temozolomide are the direct cause of hypermutation’. 
This was supported by the discovery of a mutational signature (single 
base substitution (SBS) signature 11) characterized by the accumulation 
of G:C>A:T transitions at non-CpG sites in hypermutated gliomas after 
exposure to alkylating agents“. However, the fact that hypermutationis 
undetectable in most gliomas that recur after temozolomide treatment 
challenges this notion*>. Furthermore, it remains unclear whether this 
mutational pattern enhances tumour immunogenicity and renders 
gliomas responsive to PD-1 blockade. Not all hypermutated cancers 
respond to such treatments’ °; amore accurate characterization of the 
phenotypic and molecular features of hypermutated gliomas therefore 
would help clinicians to manage such patients more effectively. 


Mutational burden and signatures in gliomas 


Previous studies included too few hypermutated gliomas to 
characterize the landscape of hypermutationin gliomas! *. We therefore 
created a cohort of sufficient scale (n = 10,294) and subtype diversity 
by leveraging large datasets generated from clinical sequencing panels 
(DFCI-Profile, MSKCC-IMPACT and FMI)’*”. Allsamples from patients 
with a histopathological diagnosis of glioma were included and classi- 
fied into molecular subgroups according to histopathology, mutational 
status of [DH1/2, and whole-arm co-deletion of chromosomes 1p and 
19q (1p/19q co-deletion) (Extended Data Fig. 1, Supplementary Tables 1, 
2). We quantified the TMB of all samples (median 2.6 mutations (mut.) 
per Mb (range 0.0-781.3)), established thresholds for hypermutation 
by examining the distribution of TMB (Extended Data Fig. 2)'”'8, and 
identified 558 (5.4%) hypermutated gliomas (median TMB 50.8 mut. 
per Mb (8.8-781.3)) for further analysis. 

Using samples with detailed clinical annotation (DFCI-Profile), we 
found that the prevalence of hypermutation varied between and within 
subgroups (Fig. 1a, b, Extended Data Fig. 3a, b, Supplementary Table 3). 
Hypermutation was detected almost exclusively in diffuse gliomas 
(99.1% of hypermutated samples) with high-grade histology (95.6%) and 
was more prevalent in recurrent tumours (16.6% versus 2.0% in newly 
diagnosed tumours; Fisher’s exact test, P< 10") (Fig. 1b). In samples 
of recurrent tumours, hypermutation was associated with markers 
of response to alkylating agents, including /DH1/2 mutation (hyper- 
mutation in 1.4% of newly diagnosed versus 25.4% of post-treatment 
/DH1/2-mutant tumours, Fisher’s exact test, P= 2.0 x 107), 1p/19q 
co-deletion (0.0% versus 33.8%, P= 7.3 x 10), and MGMT promoter 
methylation (2.4% versus 24.2%, P= 9.0 x 10°”). The effect of IDH1/2 
mutation was confirmed only in MGMT-methylated tumours (Extended 
Data Fig. 3c). These findings suggest that selective pressure from ther- 
apy may elicit progression towards hypermutation. 

The standard treatment for gliomas includes surgery, radiation and 
chemotherapy with alkylating agents””°. To assess the role of each 
of these in the development of hypermutation, we analysed associa- 
tions between TMB and detailed patterns of treatment in 356 recurrent 
gliomas. Hypermutation was associated with prior treatment with 
temozolomide (Fisher's exact test, P< 10~) ina dose-dependent man- 
ner (Fig. 1b, Extended Data Fig. 3d, e), but not with radiation (P= 0.88) 
or nitrosoureas (P= 0.78). Among recurrent tumours from patients 
who had received only one adjuvant treatment modality, TMB was 
increased only intemozolomide-treated samples (median 16.32 (inter- 
quartile range (IQR) 6.95-70.32) versus 6.08 (3.80-7.97) with surgery 


518 | Nature | Vol580 | 23 April 2020 


only, P=4.0 x 107; Extended Data Fig. 3f). Of note, the prevalence of 
hypermutation in post-temozolomide samples correlated with the 
chemosensitivity of the primary, molecularly defined tumour type 
(1p/19q co-deleted oligodendrogliomas (59.5%) > /DH1/2-mutant astro- 
cytomas (30.2%) > MGMT-methylated /DH1/2 wild-type glioblastomas 
(23.1%) > MGMT-unmethylated /DH1/2 wild-type glioblastomas (5.6%); 
P=3.8 107; Fig. 1b). We observed a similar patternin the FMI validation 
dataset (Extended Data Fig. 3g-i). 

The systematic analysis of somatic mutation patterns by genome 
sequencing has identified a variety of mutation signatures in human 
cancer which are driven by known and unknown DNA damage and repair 
processes“. We examined the contributions of 30 previously reported 
signatures (COSMIC signatures v2) within our cohort to investigate 
the biological processes that cause hypermutation in gliomas. We first 
validated that mutational signatures can be predicted using large tar- 
geted panel sequencing in hypermutated samples (Extended Data 
Figs.4,5a-c). The majority of denovo hypermutated gliomas harboured 
mutational signatures associated with defects in the MMR pathway 
(COSMIC signatures 6, 15, 26 and 14) or the DNA polymerase POLE 
(10 and 14)" (69% and 35% of samples, respectively; Extended Data 
Fig. 5d, e), implying that constitutional deficiency in MMR or POLE was 
likely to be the underlying genetic cause of hypermutation. By contrast, 
98% of post-treatment hypermutated gliomas showed a mutational 
signature that has been previously associated with temozolomide expo- 
sure (signature 11). We also identified two distinct mutational signatures 
that were highly correlated with mutational signature 11 (Extended Data 
Fig. 5b, c) including a previously undescribed signature (S2) associated 
with 1p/19q co-deletion and lack of prior radiation therapy. Finally, 
half of the samples with a dominant signature 11 showed a co-existing 
minor MMR- or POLE-deficiency signature component (Extended Data 
Fig. 5e), suggesting that defective DNA repair and mutagen exposure 
cooperate to drive hypermutation in recurrent gliomas. 


Molecular drivers of hypermutation 


Only a subset of temozolomide-treated samples (58 of 225, 25.8%) 
showed evidence of hypermutation, suggesting that additional 
factors are required for its development. Although MMR defects 
have been consistently observed in hypermutated gliomas’ *”, their 
co-occurrence with high TMB did not enable prior studies to deter- 
mine the degree to which MMR mutations represent passenger versus 
hypermutation-causing driver events. We systematically characterized 
mutations and copy number variants (CNVs; Supplementary Figs. 1, 2) 
to identify hypermutation drivers using an unbiased approach that 
controlled for the increased incidence of passenger mutations associ- 
ated with hypermutation”. Inthe merged DFCI-Profile/MSKCC-IMPACT 
dataset, 36 genes were significantly enriched (g value < 0.01) in hyper- 
mutated tumours (Fig. 2a). Collectively, MMR mutations stood out 
among the most enriched (91.2% versus 4.9% in non-hypermutated sam- 
ples, g<1.6 x10), and mutations in MSH6 showed the highest enrich- 
ment (43.0% versus 1.2%, g=3.3 x 10°’) (Extended Data Figs. 3j-I, 6a, b). 
MMR-variant allele frequencies (VAFs) and cancer cell fractions (CCFs) 
in gliomas were most similar to those in MMR-deficient colorectal 
(CRC) or endometrial cancers and were higher than in MMR-proficient 
hypermutated cancers (Extended Data Fig. 6c, d). Some MMR variants 
in post-treatment hypermutated samples matched the canonical sig- 
nature 11 sequence context (Extended Data Fig. 5f), suggesting that 
a subset of these variants are likely to have been caused by temozolo- 
mide treatment. 

As most MMR variants lacked functional annotation, we next inte- 
grated sequencing data with immunohistochemistry for protein 
loss (Extended Data Fig. 6e). Overall, results from both assays were 
concordant, consistent with MMR mutations leading to loss of func- 
tion. Inrare samples that lacked MMR variants, signature analysis and 
MMR immunohistochemistry revealed evidence for MMR deficiency, 
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Fig. 1| TMB and mutational signature analysis reveals clinically distinct 
subgroups of hypermutated gliomas. a, Integrated analysis of the 
DFCI-Profile dataset (n = 1,628 gliomas) depicting TMB, indels at homopolymer 
regions, andthe single nucleotide variant (SNV) mutation spectrum in each 
tumour according to molecular status of DH1/2, 1p/19q co-deletion, 
chromosome 7 gain and/or chromosome 10 deletion (7gain/10del), MGMT 


suggesting that these samples harboured underlying MMR defects that 
could not be identified by sequencing (for example, promoter meth- 
ylation). We identified several MMR mutational hotspots (Extended 
Data Fig. 6f, Supplementary Table 4), including a recurrent MSH6 
mutation (p.T1219l, in 7.4% of hypermutated tumours) that has 
been previously identified in Lynch syndrome and shown to exert a 
dominant-negative effect without affecting protein expression”? 
(Extended Data Fig. 6g, h). 

Immunohistochemistry on an independent cohort of 213 recurrent 
post-alkylator gliomas further validated these findings (Supplementary 
Table 2). MMR protein expression was lost in 22 post-treatment sam- 
ples, and this loss was associated with /DH1/2 mutations (20% mutant 
versus 2% wild-type; Fisher’s exact test, P= 8.0 x 10) (Extended Data 
Fig. 7a, b). Sequencing of samples with MMR protein loss confirmed 
hypermutation, with MMR mutations in 18 of 19 (94.7%) of these sam- 
ples. Subclonal loss of MMR proteins (that is, protein retained inmore 
than 20% of tumour cells) was more common in post-treatment than 
de novo hypermutated gliomas (12 of 46 (26.1%) versus 0 of 16 (0.0%), 
P=0.03) (Extended Data Fig. 7c-f). 

We next assessed the relationship between MMR deficiency and 
acquired chemotherapy resistance. Because hypermutation and MMR 
defects were almost exclusively seen after temozolomide treatment, 
we hypothesized that nitrosoureas and temozolomide might not show 
complete cross-resistance. Analysis of temozolomide sensitivity in 
30 cell lines derived from patients with glioma (patient-derived cell 
lines, PDCLs), including four derived from MMR-deficient gliomas 
(Extended Data Fig. 8a—c), showed that all native MMR-deficient PDCLs 
had striking temozolomide resistance compared to MMR-proficient 
PDCLs (6.46- and 1.35-fold increase in median area under the curve 
(AUC) versus MMR-proficient-MGMT-deficient and MMR-proficient- 
MGMT-proficient PDCLs, respectively) (Fig. 2b, Extended Data Fig. 8d- 
f). We next treated native and engineered isogenic MMR-knockout 


promoter methylation, histological grade, age at initial diagnosis, and prior 
treatment. Red line denotes high TMB (217.0 mut. per Mb). b, Prevalence of 
hypermutation inthe DFCI-Profile dataset. Chi-squared test and two-sided 
Fisher’s exact test. NA, not available; TMZ, temozolomide; WT, wild-type; mut, 
mutant; codel, co-deleted. 


glioma models with temozolomide or the nitrosourea lomustine 
(CCNU), a chloroethylating alkylating agent that generates DNA 
interstrand crosslinks and double-strand breaks (Fig. 2c, Extended Data 
Fig. 8g-i). All MMR-deficient models were resistant to temozolomide 
and sensitive to CCNU, consistent with the lack of hypermutation in 
samples from nitrosourea-treated patients™ (Extended Data Fig. 3f). 


Mismatch repair deficiency and signature 11 


Our analyses indicated that MMR deficiency together with temozo- 
lomide exposure might cause signature 11, as opposed to it being 
a ‘pure’ temozolomide signature. To test this idea, we exposed iso- 
genic models of MMR deficiency to temozolomide (Extended Data 
Fig. 9a, b). After treatment with temozolomide, MMR-deficient PDCLs 
developed hypermutation with signature 11, whereas MMR-proficient 
controls (expressing sgGFP) did not (Fig. 2d). We then chronically 
treated temozolomide-sensitive glioblastoma xenografts (PDXs) with 
temozolomide until resistance was acquired (Fig. 2e, Extended Data 
Fig. 9c, d). These tumours developed hypermutation with signature 
11 (Fig. 2f, Extended Data Fig. 9e) and shared four unique variants; 
the dominant-negative MSH6 hotspot mutation (p.T12191) and three 
non-coding variants (Fig. 2g), consistent with the theory that the MSH6 
mutation drives both hypermutation and acquired temozolomide 
resistance (Extended Data Fig. 9f). 

Collectively, these findings show that temozolomide exerts a previ- 
ously underappreciated selective pressure in favour of MMR-deficient 
cells, which are resistant to temozolomide. Exposing MMR-deficient 
cells to temozolomide induces hypermutation with signature 11 by 
causing DNA damage in the absence of functional MMR. Therefore, 
hypermutation with signature 11 represents a potential biomarker for 
MMR deficiency and temozolomide resistance in gliomas (Extended 
Data Fig. 9g). 
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Fig. 2| MMR deficiency drives hypermutation and chemotherapy 
resistance in gliomas. a, Mutated genes and pathways enriched in 
hypermutated gliomas in the merged DFCI-Profile/MSKCC-IMPACT dataset 
(n=2,173) using a permutation test to control for random mutation rate in the 
setting of hypermutability. b, Response to temozolomide across a panel (n = 30) 
of native spheroid glioma PDCLs (blue, MMR-proficient; red, MMR-deficient). 
Dose-response curves were calculated using mean surviving fractions from 
three independent assays. c, Response to temozolomide and CCNU in the 
glioblastoma PDCL BT145 following knockout of MSH2, MSH6, MLH1 or PMS2by 
CRISPR-Cas9. Dose-response curves were calculated using mean surviving 
fractions from three independent assays (mean +s.e.m.).d, Number of signature 
11 variants after chronic temozolomide treatment of the PDCL DIPG13 with MSH2 
or MSH6 knockout by CRISPR-Cas9. Mutational signatures could not be called in 


Characteristics of MMR-deficient gliomas 


MMR deficiency recently emerged as an indicator of response to PD-1 
blockade in patients with cancer®”, leading to the first tissue-agnostic 
cancer-drug approval by the US Food and Drug Administration for use 
of the PD-1 blocker pembrolizumab in patients with MMR-deficient 
cancers. However, in CRCs and some other cancers, MMR inactivation 
occurs early intumour progression, whereas in post-treatment gliomas 
it arises late. Gliomas might therefore differ from other cancers on 
which the approval was based and these differences might influence 
immune recognition of tumours and the response to immunotherapy. 

To test this hypothesis, we first assessed the outcome of hypermu- 
tated gliomas. In CRC, MMR deficiency is associated with improved 
outcomes. By contrast, among patients with recurrent glioma, we 
observed worse survival in both hypermutated high-grade 1p/19q 
co-deleted oligodendrogliomas (median overall survival (OS) 
96.5 months (95% confidence interval (CI) 20.8-NA (not applicable)) 
versus 137.2 months (95% Cl 41.8-NA) innon-hypermutated tumours, 
P=0.0009, two-sided log-rank test) and /DH1/2-mutant astrocytomas 
(median OS 15.7 months (95% C112.9-18.3) versus 21.5 months (95% Cl 
19.2-29.8), P=0.0015) (Fig. 3a, Extended Data Fig. 10a-c). We observed 
asimilar trend in /DH1/2 wild-type glioblastomas (P= 0.0809). The find- 
ing of poor survival in recurrent hypermutated gliomas remained sig- 
nificant in multivariable analysis (hazard ratio 2.16 (95% C11.38-3.38), 
P=0.0008; Supplementary Table 5). 

The current hypothesis behind the response of MMR-deficient CRCs 
to PD-1 blockade is based on their increased neoantigen burden and 
immune infiltration. We therefore assessed the association between 
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the vehicle-treated samples (too few variants). e, Tumour volume (n= 8 mice per 
group) during treatment with vehicle (blue) or temozolomide (red) in BT145 
patient-derived xenografts (PDXs). f, Number of signature 11 variants found after 
chronic temozolomide exposure in BT145 PDXs. Mutational signatures could not 
be called in the vehicle-treated tumours (too few variants). g, Schematic 
representation of BT145 PDXs clonal evolution under temozolomide exposure. 
Two independent secondary resistant tumours (Resistant 1 and 2) and one 
vehicle-treated tumour are represented. Resistant tumours had four private 
variants that were not detected in the vehicle-treated tumour: an MSH6(T12191) 
mutation (VAF 0.27 and 0.37 for resistant 1 and 2, respectively), and three 
non-coding variants of NF1 (VAF 1.0 and 0.99), RAC] (VAF 0.86 and 0.86) and RAFI 
(0.44 and 0.56). HGG, high-grade glioma; Chr, chromosome. 


MMR deficiency and T-cell infiltration in gliomas (n = 43) and CRCs 
(n=19). As expected, MMR-deficient CRCs exhibited significantly more 
infiltrating T-cells than their MMR-proficient counterparts (Fig. 3b). 
By contrast, both MMR-deficient and MMR-proficient glioma samples 
lacked significant T-cell infiltrates (Fig. 3c). 

We next assessed whether the neoantigen burden was lower in 
MMR-deficient gliomas than in other hypermutated cancers using 
samples from the GENIE and TCGA datasets (n=1,748 and 699 hypermu- 
tated cancers, respectively). As neoantigen prediction was not feasible 
using panel sequencing data, we used the nonsynonymous mutational 
burden as a surrogate measure. This showed that both de novo and 
post-treatment MMR-deficient gliomas had an increase in their nonsyn- 
onymous mutational burden, when compared to non-hypermutated 
gliomas, and the glioma nonsynonymous mutational burden was simi- 
lar to other hypermutated cancers (Fig. 3d, Extended Data Fig. 11a, b, 
Supplementary Table 6). This finding suggested that the total number 
of neoantigens is unlikely to explain the differences in immune response 
between gliomas and other hypermutated cancers. 

Recent data suggest that, among mutations associated with MMR 
deficiency, small insertions and deletions (indels) at homopolymers 
(microsatellites)—which accumulate in MMR-deficient cells and can 
cause frameshift mutations—are crucial for producing ‘high-quality’ 
neoantigens that are recognized by immune cells”°”®. Unexpectedly, 
although the high TMB was associated with an increased homopolymer 
indel burden in MMR-deficient CRCs, this association was not found in 
MMkR-deficient gliomas (de novo hypermutated gliomas showed a mod- 
est increase; Fig. 3d, Extended Data Fig. 11c). This was validated using 
testing for microsatellite instability (MSI), aclinical biomarker for MMR 
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Fig. 3 |Hypermutated and MMR-deficient gliomas harbour unique 
phenotypic and molecular characteristics including poor outcome and 
lack of MSI in bulk sequencing. a, Survival of patients with recurrent 
high-grade glioma from the time of sample collection according to 
histomolecular group and TMB status (n= 333 recurrent samples; 238 

from DFCI-Profile, 95 from MSKCC-IMPACT). Two-sided log-rank test. 

b, Quantification of tumour-infiltrating CD3-positive T-cells in CRC samples 
(n=19). Left, representative low- and high-magnification images of CD3 
immunolabelling (brown; intraepithelial lymphocytes, black arrowheads; 
stromal lymphocytes, black arrows) and nuclear counterstaining (blue). 
Dashed lines, border between tumour and stroma. Only intraepithelial 
lymphocytes were quantified. Scale bars; 100 xm (100x), 50 um (200%). 
Right: boxes, quartiles; centre lines, median ratio for each group; whiskers, 
absolute range. Two-sided Wilcoxon rank-sum test. c, Quantification of 
tumour-infiltrating CD3-positive T-cells in gliomas according to their MMR 
status (n=43). For each group, three areas with the maximal CD3 infiltration 
were selected for quantification (representative images, left). Scale bars: 
500 pm (20x), 50 pm (200x). Right: boxes, quartiles; centre lines, median ratio 
for each group; whiskers, absolute range. Kruskal-Wallis test and Dunn’s 
multiple comparison test. d, TMB (top) and homopolymer indel burden 


deficiency. Whereas MSI was identified in all MMR-deficient CRCs, all 
tested gliomas with MMR protein loss (n=15) were microsatellite-stable 
(MSS) (Extended Data Figs. 7d-f, 11d). 

We hypothesized that, in hypermutated gliomas, more of the 
homopolymer indels are subclonal and below the detection limits 
of bulk sequencing, relative to other MMR-deficient cancers. Indeed, 
analysis of CCFs indicated that hypermutated gliomas contained a 
greater burden of subclonal variants than did other hypermutated 
cancers (Fig. 3e, Extended Data Fig. 11e—h). We therefore performed 
single-cell whole-genome DNA sequencing (scWGS) of 28 cells froma 
hypermutated, post-temozolomide glioblastoma with an MSH6(T12191) 


(bottom) inhypermutated gliomas compared with other hypermutated 
cancers from the GENIE dataset. Tukey’s boxplots are shown. Two-sided 
Wilcoxon rank-sum test with Bonferroni correction. e, Pan-cancer analysis of 
cancer cell fractions in hypermutated gliomas (post-treatment) compared with 
other hypermutated cancers from the TCGA and ref. * exome datasets (n=798). 
One hundred non-hypermutated samples from the TCGA were randomly 
selected as controls. Boxes, quartiles; centre lines, median ratio for each 
group; whiskers, absolute range excluding outliers. Two-sided Wilcoxon 
rank-sum test with Bonferroni correction. f, Workflow for scWGS and bulk 
tumour DNA sequencing. g, Single-cell sequencing estimate of the number of 
G:C>A:T transitions at NCC and NCT trinucleotide contexts in 63 cells froma 
glioblastoma patient with post-temozolomide hypermutation using 1x scWGS 
sequencing. Error bars show 95% Cl. The absolute computed purity was 0.66 
for the primary tumour sample and 0.47 for the recurrent tumour sample inthe 
bulk sequencing. h, Single-cell sequencing estimate of microsatellite mutation 
rate in eight cells froma patient with glioblastoma with post-temozolomide 
hypermutation. Eight cells were analysed for the presence of MSI using 

10x scWGS sequencing. WGA, whole genome amplification; QC, quality 
control; nucl, nuclei; seq, sequencing. 


mutation, and compared these to 35 non-hypermutated cells from 
the matched pre-treatment sample (Fig. 3f, Extended Data Fig. 11i-k). 
In the post-temozolomide sample, 13 of 28 cells (46.4%) were hyper- 
mutated with signature 11 (Fig. 3g, Extended Data Fig. 111). Strikingly, 
whereas this tumour harboured only a minor increase in its homopoly- 
mer indel burden at the bulk level (0.49 versus 0.0 per Mb), the scWGS 
analysis showed a ninefold increase in microsatellite mutations in all 
hypermutated cells (Fig. 3h). This suggested that glioma cells with 
an MSH6(T12191) variant harbour a subtle MSI phenotype that is not 
revealed by standard bulk sequencing or clinical MSI assays (Extended 
Data Fig. 11m). 
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Fig. 4 | Treatment of hypermutated gliomas with PD-1 blockade. a, b, Best 
radiological response (a, measured as the best change in the sum of the products 
of perpendicular diameters of target lesions), and overall survival (b) of 11 
patients with hypermutated and MMR-deficient gliomas who were treated with 
PD-1 blockade. A cohort of patients with non-hypermutated gliomas who were 
treated with PD-1 blockade is depicted as control (n=10, best matches according 
to diagnosis, primary versus recurrent status, and prior treatments). Two-sided 
log-rank test. c, Proposed model explaining differential response to PD-1 
blockade in MMR-deficient CRCs and gliomas. In CRCs (top), MMR deficiency is 
acquired early in pre-cancerous cells, creating mutations and indels at 
homopolymer regions. Over time, clonal neoantigens of both types emerge and 


PD-1 blockade in MMR-deficient gliomas 


As hypermutation in gliomas that acquire MMR deficiency tends to be 
subclonal and does not generate optimal antitumour T-cell responses, 
we hypothesized that these tumours might not have high response 
rates to PD-1 blockade. We performed a retrospective institutional 
review of patients treated with PD-1 pathway blockade for which the 
TMB at treatment initiation was available (n = 210). This identified 11 
patients with MMR-deficient glioma (5 de novo, 6 post-treatment) who 
were treated with PD-1 blockade for a median of 42 days (range 13-145; 
Supplementary Table 7). Nine (81.8%) had disease progression as their 
best response (Fig. 4a), and the median progression-free survival (PFS) 
and OS were 1.38 months (95% CI 0.95-2.69) and 8.7 months (95% Cl 
2.79-15.08), which were not significantly different from the data for 
matched patients with non-hypermutated glioma (PFS 1.87 months 
(95% C11.28-2.92), OS 9.96 months (95% Cl 7.56-15.08); Fig. 4b, Extended 
Data Fig. 10d). 

Because our prior analyses indicated that patients with hypermu- 
tated gliomas might have reduced survival, we used a second set of 
historical controls to compare the outcome of hypermutated glio- 
mas treated with PD-1 blockade versus other systemic agents (Supple- 
mentary Table 7). Unexpectedly, we observed a longer median OS for 
patients treated with other systemic agents when compared to those 
treated with PD-1 blockade (16.10 months (95% CI 3.98-22.21) versus 
8.07 (95% CI 2.79-15.08.21); P= 0.02, two-sided log-rank test; Extended 
Data Fig. 10e, f, Supplementary Table 8). In one patient with hypermu- 
tated glioma that showed rapid imaging changes, histopathologic 
analysis of samples taken before and after treatment with PD-1 blockade 
showed highly proliferative tumour in both samples, with no significant 
evidence of pathologic response or increase inimmune infiltrates after 
PD-1 blockade (Extended Data Fig. 10g). 


DISCUSSION 


Collectively, these results support a model in which differences in 
the mutation landscape and antigen clonality of hypermutated gli- 
omas relative to other hypermutated cancers markedly affect the 
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strong immune infiltrates are seen at diagnosis. Treatment with anti-PD-1 results 
in expansion of T cells that recognize these clonal neoantigens and substantial 
antitumour responses. In gliomas (bottom), few mutations are acquired early 
during tumorigenesis in the majority of tumours. Temozolomide drives the 
expansion of cells with MMR deficiency and late accumulation of random 
temozolomide-induced mutations. Ineffective antitumour responses may result 
from poor neoantigen quality (high burden of missense mutations versus 
frameshift-producing indels) and high subclonality associated with an 
immunosuppressive microenvironment. In some tumours, MMR-proficient 
subclones that have acquired therapy resistance through other pathways can 
co-exist with MMR-deficient subclones, giving rise to a mixed phenotype. 


response to immunotherapy (Fig. 4c) and may explain the lack of both 
recognition of MMR-deficient glioma cells by the host immune system 
and response to PD-1 blockade, compared to other MMR-deficient 
cancers®”, A key difference is that MMR-deficient gliomas lack detect- 
able MSI by standard assays, similar to data from patients with consti- 
tutional MMR deficiency syndromes”. Our scWGS analyses suggest 
that this discordance might be due to intratumour heterogeneity and 
alack of sufficient evolutionary time to select clonal MSI populations. 
Mechanistically, selective pressure exerted by temozolomide drives 
the late evolution of MMR-deficient subclones, which further accumu- 
late temozolomide-induced mutations in individual cells. Inline with 
previous data, therapy-induced single nucleotide variant mutations 
might not elicit effective antitumour responses, possibly because of 
the quality (missense mutations versus frameshift-producing indels) or 
subclonal nature of their associated neoantigens*”” ”. However, future 
evaluation of longer treatment exposure or combinatorial strategies is 
warranted to determine whether checkpoint blockade can be effective 
inthis or other selected populations (for example, individuals with 
newly diagnosed MMR- or POLE-deficient gliomas)°. 

We have presented evidence that recurrent defects in the MMR path- 
way drive hypermutation and acquired temozolomide resistance in 
chemotherapy-sensitive gliomas. Although it is difficult to determine 
the origin of MMR deficiency by sequence context alone in individual 
post-treatment samples, our data suggests that some MMR variants 
are likely to be caused by temozolomide. However, as acquired MMR 
deficiency occurs in the most temozolomide-sensitive tumours, it is 
not clear whether the acquired MMR deficiency outweighs the positive 
effects of temozolomide in gliomas. Our finding that MMR-deficient 
cells retain sensitivity to CCNU supports the hypothesis that hyper- 
mutation reduces cellular fitness and tolerance to DNA-damaging 
agents other than temozolomide. These alternatives are of interest in 
light of recent evidence showing that the addition of CCNU to chemo- 
radiation improves the outcome of patients with MGMT-methylated 
glioblastomas™. Future studies are warranted to address the possi- 
bility that upfront temozolomide with CCNU may attenuate the pro- 
cess of post-treatment hypermutation. Furthermore, mechanisms of 


resistance to temozolomide that are not associated with hypermutation 
will need to be addressed. 


Finally, our data indicate that the absence of animmune response in 


gliomas is likely to result from several aspects of immunosuppression 
in the brain that require further characterization. Approaches that 
increase infiltration by cytotoxic lymphocytes into the glioma micro- 
environment will probably be required to improve immunotherapy 
response. Our data also suggest a change in practice whereby repeated 
biopsies and sequencing to identify progression and hypermutation 
could inform prognosis and guide therapeutic management. 
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Methods 


Datasets 

For the DFCI-Profile dataset, clinical data and tumour variant calls 
identified through targeted next-generation sequencing (NGS) panels 
of 1,628 gliomas sequenced between June 2013 and November 2018 as 
part ofa large institutional prospective profiling program (DFCI-Profile) 
were included” (Extended Data Fig. 1). The distinction between pho- 
ton and proton radiotherapy was not systematically captured; the 
vast majority of patients underwent photon radiotherapy. For the 
MSKCC-IMPACT and FMI datasets, clinical data and tumour variant 
calls from a total of 545 and 8,121 samples, respectively, that could be 
assigned toa molecular subgroup (see below) were included*”*23, For 
pan-cancer analyses in targeted panel sets, clinical data and tumour 
variant calls from the GENIE project (a repository of genomic data 
obtained during routine clinical care at international institutions) were 
downloaded from Synapse (public data, release v6.1)**. For pan-cancer 
analyses in whole-exome sequencing sets, clinical data and tumour vari- 
ant calls from 17 hypermutated glioblastomas* and from the pan-cancer 
TCGA dataset were downloaded from the NCI Genomic Data Com- 
mons”. In addition, 247 gliomas collected at one site between 2009 
and 2017 were analysed for protein expression of four MMR proteins 
(MSH2, MSH6, MLH1, and PMS2) using immunohistochemistry. Writ- 
ten informed consent or IRB waiver of consent was obtained from all 
participants. Patients of the FMI dataset were not consented for release 
of raw sequencing data. The study, including the consent procedure, 
was approved by the institutional ethics committees (10-417/11-104/17- 
000; Western Institutional Review Board (WIRB), Puyallup, WA). 


Tumour genotyping and diagnosis 

For the majority of samples, genomic testing was ordered by the pathol- 
ogist or treating physician as part of routine clinical care to identify 
relevant genomic alterations that could potentially inform diagnosis 
and treatment decisions. Patients who underwent DFCI-Profile testing 
signed a clinical consent form, permitting the return of results from 
clinical sequencing. In total, 1,628 gliomas were sequenced as part of 
acohort of 21,992 tumours prospectively profiled between June 2013 
and November 2018. Research tumour diagnoses were reviewed and 
annotated according to histopathology, mutational status of /DH1 and 
IDH2 genes, and whole-arm co-deletion of chromosomes Ip and 19q 
(1p/19q co-deletion), according to WHO 2016 criteria”. All samples 
were assigned to one of four main molecular subgroups: /DH1/2-mutant 
and 1p/19q co-deleted oligodendrogliomas (high- and low-grade), 
IDH1/2-mutant astrocytomas (high- and low-grade), /DH1/2 wild-type 
glioblastomas (high-grade only), and /DH1/2 wild-type gliomas of other 
histologies (high- and low-grade), the latter including grade pilocytic 
astrocytomas, glioneuronal tumours and other unclassifiable gliomas. 
For simplification, /DH1/2 wild-type grade Ill anaplastic astrocytomas 
and grade IV diffuse intrinsic pontine gliomas were assigned to the 
group of [DH1/2 wild-type glioblastomas in all analyses. Samples for 
which the clinical diagnosis of glioma could not be confirmed (other 
histology or possible non-tumour sample) and five samples with miss- 
ing minimal clinical annotation were excluded from all analyses. For 
the MSKCC-IMPACT and FMI datasets, patients also signed a consent 
form, and samples were classified using the same procedure. MGMT 
promoter methylation status was determined as part of routine clini- 
cal care using chemical (bisulfite) modification of unmethylated, but 
not methylated, cytosines to uracil and subsequent PCR using primers 
specific for either methylated or the modified unmethylated DNA inthe 
CpG island of the MGMT gene (GenBank accession number AL355531 
nt46931-47011). 

Targeted panel next-generation sequencing (DFCI-Profile) was per- 
formed using the previously validated OncoPanel assay at the Center 
for Cancer Genome Discovery (Dana-Farber Cancer Institute) for 
277 (POPv1), 302 (POP v2), or 447 (POPv3) cancer-associated genes’**°, 


In brief, between 50 and 200 ng tumour DNA was prepared as previously 
described’, hybridized to custom RNA bait sets (Agilent SureSelect 
TM, San Diego, CA) and sequenced using Illumina HiSeq 2500 with 
2 x 100 paired-end reads. Sequence reads were aligned to reference 
sequence b37 edition from the Human Genome Reference Consortium 
using bwa, and further processed using Picard (version 1.90, http:// 
broadinstitute.github.io/picard/) to remove duplicates and Genome 
Analysis Toolkit (GATK, version 1.6-5-g557da77) to perform localized 
realignment around indel sites®*. Single-nucleotide variants were called 
using MuTect v1.1.4”, insertions and deletions were called using GATK 
Indelocator, and variants were annotated using Oncotator*®. Copy 
number variants and structural variants were called using the internally 
developed algorithms RobustCNV“!and BreaKmer® followed by manual 
review. To filter out potential germline variants, the standard pipeline 
removes SNPs present at >0.1% in Exome Variant Server, NHLBI GO 
Exome Sequencing Project (ESP), Seattle, WA (http://evs.gs.washington. 
edu/EVS/, accessed May 30, 2013), present in dbSNP, or present in an 
in-house panel of normal tissue, but rescues those also present in the 
COSMIC database”’. For this study, variants were further filtered by 
removing variants present at >0.1% in the gnomAD v.2.1.1 database 
or annotated as benign or likely benign in the ClinVar database***». 
Arm-level copy number changes were generated using an in-house 
algorithm specific for panel copy number segment files followed by 
manual expert review. We set acopy number segment mean log, ratio 
threshold at which we could accurately call arm amplification and dele- 
tion based on the average observed noise in copy number segments. 
Chromosome arms were classified as amplified or deleted if more than 
70% of the arm was altered. Asample was considered co-deleted if more 
than 70% of both 1p and 19q were deleted. 

Sequencing data from MSKCC-IMPACT were generated at the Memo- 
rial Sloan Kettering Cancer Center using a custom targeted panel cap- 
ture to examine the exons of 341 (IMPACT341) or 398 (IMPACT410) 
cancer-associated genes as previously described”. The FMI dataset 
comprised specimens sequenced as a part of clinical care using a 
targeted next-generation sequencing assay as previously described 
(FoundationOne or FoundationOne CDx, Cambridge, MA)’. Germline 
variants without clinical significance were further filtered by applying 
an algorithm to determine somatic or germline status**. Results were 
analysed for genomic alterations, TMB, MSI and mutational signa- 
tures. TMB was assessed by counting all mutations and then excluding 
germline and known driver mutations®“*”,. The remaining count was 
divided by the total covered exonic regions»*?. MSI status was deter- 
minedas previously described**. A log-ratio profile for each sample was 
obtained by normalizing the sequence coverage at all exons and ~3,500 
genome-wide SNPs against a process-matched normal control. This pro- 
file was corrected for GC-bias, segmented and interpreted using allele 
frequencies of sequenced SNPs to estimate tumour purity and copy 
number at each segment. Loss of heterozygosity (LOH) was called if 
local copy number was 1, or if local copy number was 2 with an estimated 
tumour minor allele frequency of 0%. To assess 1p/19q co-deletion, we 
calculated the percentage of each chromosome arm that was monoal- 
lelic (under LOH)**. Asample was considered 1p/19q co-deleted if both 
Ip and 19q were >50% monoallelic. 

For the DFCI-Profile and FMI datasets, the appropriate cutoffs for 
hypermutation (17.0 and 8.7 mut/Mb, respectively) were determined by 
examining the distribution of TMB in all samples and further confirmed 
using segmented linear regression analysis (Extended Data Fig. 2). For 
the MSKCC-IMPACT datasets, a threshold previously validated in this 
dataset was used”. In all analyses, the homopolymer indel burden was 
calculated by computing the number of single base insertions or dele- 
tions in homopolymer regions of at least 4 bases in length and dividing 
the count by the total exonic coverage as previously established”. 
Somatic variants were annotated as previously described ”°””, In 
addition, for the DFCI-Profile and MSKCC-IMPACT datasets, variants in 
aselected list of glioma-and DNA-repair associated genes (/DH1, IDH2, 


TERT, ATRX, CIC, H3F3A, HIST1H3B, EGFR, PDGFRA, FGFRI, FGFR2, FGFR3, 
MET, KRAS, NRAS, HRAS, BRAF, NF1, PTPN11, PTEN, PIK3CA, PIK3C2B, 
PIK3R1, CDKN2A, CDKN2B, CDKN2C, CDK4, CDK6, CCND2, RB1, TP53, 
MDM2, MDM4, TP53BP1, PPMID, CHEK1, CHEK2, RADS1, BRCA1, BRCA2, 
ATM, ATR, MLH1, MLH3, PMS1, PMS2, MSH2, MSH6, EPCAM, SETD2, POLE, 
POLD1, MUTYH, WRN) were manually reviewed for oncogenicity using 
several clinical databases for variant annotation (OncokB, ClinVar, 
COSMIC, ExAC, and ARUP). 


Mutational signature analyses 

All variants detected by the sequencing pipeline covered by at least 
30x read depth were stringently filtered for germline origin using the 
gnomAD (population allele frequency greater than 0.1%), and ClinVar 
(benign or likely benign annotation) databases**”, as well as manual 
review of VAF distributions and variants with VAFs consistent with 
possible germline origin (45-55% or over 95%). The mutational spec- 
trum of variants filtered during these steps was similar to a previously 
published germline mutation spectrum®. Signature analysis was per- 
formed for hypermutated samples ina two-step approach starting with 
the SomaticSignatures package in R for de novo signature extraction 
within each group”. To account for the inherent heuristic quality of 
the NMF approach, the NMF clustering step was repeated 100 times 
and chosen result was selected based on identifying signatures with 
the strongest Pearson’s correlation coefficients when compared to the 
30 well-established COSMIC signatures v2 (https://cancer.sanger.ac.uk/ 
cosmic/signatures_v2)"* (Extended Data Fig. 5a—c). We then used the 
DeconstructSigs package in R to estimate the contribution of identi- 
fied signatures using a regression model”. To account for the potential 
overfitting of aregression approach—owing to either lack of important 
signatures in the model, or inclusion of uninvolved signatures—we 
used only the signatures identified by the decomposition approach 
instep one, supplemented by any strong signature predictions identi- 
fied through a first pass run of DeconstructSigs with the 30 COSMIC 
signatures to check for samples that may show strong correlation to 
an outlier signature. For the FMI dataset, mutational signatures were 
called as previously described”. All point mutations were included in 
the analysis except known oncogenic driver mutations and predicted 
germline mutations. A sample was deemed to have a dominant signa- 
ture if a mutational signature had a score of 0.4 or greater. 

Toassess the ability ofthismethod to detect hypermutation-associated 
signatures in targeted panel sequencing data, we compared the signa- 
ture calls of exome-sequenced samples using all variants (previously 
published DeconstructSigs signature predictions”) versus using only 
variants that overlapped with the panel-targeted regions. Somatic vari- 
ant calls for bladder cancer, colon adenocarcinoma, rectal adenocar- 
cinoma, skin cutaneous melanoma, and lung adenocarcinoma (study 
abbreviations BLCA, COAD, READ, SKCM, LUAD) from the TCGA MC3 
dataset were used*’ to assess the detection of COSMIC mutational 
signatures associated with APOBEC (signatures 2 and 13), mismatch 
repair (signature 6), ultraviolet light (signature 7), POLE (signature 
10), and tobacco (signature 4). Variant calls for 17 hypermutated and 
12 non-hypermutated glioma exome-sequenced samples were used 
for assessing temozolomide (signature 11) detection*. There were two 
COAD samples with known POLE exonuclease domain oncogenic muta- 
tions anda POLE signature predicted by DeconstructSigs; these were 
used for assessing POLE signature detection. For a given threshold 
number of variants (X'), we considered how many samples had at least 
X' variants, and what percentage of these samples could correctly pre- 
dict the exome-based signature using panel-restricted variants (witha 
predicted signature fraction greater than 0.1-0.2). This analysis showed 
that panel-based signature calls for the APOBEC, mismatch repair, 
tobacco, and ultraviolet light signatures reached 90% sensitivity with 
at least 20 somatic variants. Owing to the low number of samples with 
POLE-associated and temozolomide-associated hypermutation, we did 
not assess the sensitivity of signature detection at each variant count 


threshold; we instead downsampled the number of variants in positive 
control samples to find the minimum number of variants necessary to 
reproducibly predict the known signature, which was also determined 
to be 20 somatic variants (Extended Data Fig. 4). 


Enrichment analysis 

Mutation enrichment was statistically determined through a permuta- 
tion test to control for confounders including variable mutability of 
different genes as well as sample mutation rates, which is of particular 
importance when assessing enrichment in hypermutated samples. 
First, we generated a list of every mutation in each of our samples. 
We calculated the difference in the mutation counts (A’) between the 
group of interest and the reference group. We then randomly permuted 
the mutations 100,000 times, preserving sample and gene mutation 
counts, and computed the A for each gene in each permutation. The P 
value for a given gene was determined by the fraction of permutations 
1-n (in our case, n= 100,000) for which A, > A’. Storey g values were 
generated using the qvalue package in R to adjust for multiple compari- 
sons. The analysis was first performed in the merged DFCI-Profile and 
MSKCC-IMPACT dataset, and further revalidated in the FMI dataset in 
an independent analysis. 


Single-cell whole-genome sequencing 

Frozen glioma samples were mechanically dissociated into pools of 
single nuclei as previously described™, following which single nuclei 
were isolated by flow cytometry, using a DAPI-based stain. Nuclei were 
subjected to whole-genome multiple displacement (MDA) amplifica- 
tion (Qiagen, REPLI-g) followed by next-generation sequencing library 
construction for Illumina Sequencing (Qiagen QIAseq FX DNA library 
kit). Libraries were sequenced on the Illumina HiSeq platform in paired 
end mode. Single cells were sequenced to 0.1-1* coverage. Bulk pooled 
nuclei were sequenced to 60x coverage while matched germline DNA 
(extracted from blood) was sequenced to 30x coverage. 

Reads were aligned to hg38 using bwa mem, and variants were 
jointly called across bulk normal tissue, primary tumour single cells, 
and recurrent tumour single cells using the GATK best practices pipe- 
line** without variant quality score recalibration. Somatic mutations 
insingle cells were called if they were monoallelic, had ahomozygous 
reference genotype call but no alternate-allele support in bulk normal 
tissue, and had at least three supporting reads ina single cell. Germline 
heterozygous mutations (gHets) were called ifthey were monoallelic, 
were found in dbSNP (version 138, http://www.ncbi.nlm.nih.gov/snp), 
and had at least one supporting read and a heterozygous genotype 
call in bulk normal tissue. To assess sensitivity in each single cell, we 
computed the fraction of gHets detected with at least three supporting 
reads, analogous to our procedure for calling somatic mutations. To 
estimate the total number of somatic mutations present in each cell, we 
divided the total number of somatic mutations detected by sensitivity. 
To obtain 95% confidence intervals on the total mutational burden, 
we modelled the measurement of sensitivity using a beta distribu- 
tion with Jeffrey’s prior, in which the beta parameters (a, B) are equal 
to the number of detected gHets + 0.5 and the number of undetected 
gHets + 0.5, respectively. We identified recurrent tumour single cells 
as hypermutated if their mutational burden was at least 1.5 times the 
highest mutational burden detected among primary tumour cells. 

The method to detect microsatellite mutations was based on 
read-based phasing and was previously validated using scWGS data 
fromneurons (L.C.-C. et al., manuscript in preparation). First, the human 
genome was scanned to definea reference set of microsatellite repeats 
that can be captured using short reads (that is, between 6 and 60 bp) as 
previously described*’. Heterozygous SNPs were then detected in the 
bulk normal sample using the variant caller GATK®®. Next, the readsin 
a given cell mapping to each heterozygous SNP allele detected in the 
bulk sample and their mates were extracted. If any of the microsatel- 
lites in the reference set were covered by these reads, the distribution 
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of the allelic repeat lengths supported by the data was obtained by 
collecting the lengths of allintra-read microsatellite repeats mapped 
tothe microsatellite locus under consideration. To discount truncated 
microsatellite repeats, we required 10-bp flanking sequences (both 5’ 
and 3’) of the intra-read microsatellite repeats to be identical to the 
reference genome. The same procedure was applied to the bulk sample. 
Finally, the distributions of microsatellite lengths from the single cell 
and the bulk sample were compared using the Kolmogorov-Smirnov 
test. The rates of microsatellite instability for each cell were computed 
as the number of sites mutated divided by the total number of microsat- 
ellites for which a call could be made. We applied FDR correction using 
0.05 as a threshold for statistical significance, with a minimum of 
8 single cell and 10 normal reads required to make a call. Allthe codeis 
publicly available (https://github.com/parklab/MSIprofiler). 


Immunohistochemistry 

For the revalidation of MMR defects in an independent set, all prospec- 
tively collected surgical samples representing consecutive relapses 
of diffuse glioma following treatment with alkylating agents in adult 
patients (surgery between 2009 and 2015) were included. An expert 
neuropathologist reviewed histological samples from the IHC Pitié 
Salpétriére cohort (Supplementary Table 2) in order to assess the WHO 
2016 integrated diagnosis and to select the tumour areas for immuno- 
histochemistry and for DNA extraction when molecular testing from 
formalin-fixed paraffin-embedded (FFPE) tissue material was required. 
Diffuse gliomas harbouring unambiguous positive IDH1(R132H) immu- 
nostaining were classified as /DH1/2-mutant. /DH1/2 status was tested 
by targeted sequencing in all diffuse gliomas harbouring negative or 
ambiguous IDH1(R132H) immunostaining. /DH1/2-mutant diffuse glio- 
mas with loss of ATRX expression in tumour cells were classified as non 
1p/19q co-deleted. 1p/19q co-deletion was tested in all /DH1/2-mutant 
diffuse gliomas with maintained ATRX expression. MGMT status was 
assessed in /DH1/2 wild-type gliomas. FFPE sections (3 um thick) were 
deparaffinized and immunolabelled with a Ventana Benchmark XT 
stainer (Roche, Basel, Switzerland). The secondary antibodies were 
coupled to peroxidase with diaminobenzidine as brown chromogen. 
For immunohistochemistry performed at Pitié-Salpétriére (PSL) 
Hospital, the following antibodies were used: mouse monoclonal 
anti-ATRX (Bio SB, clone BSB-108, BSB3296, 1:100), mouse monoclo- 
nal anti-IDH1(R132H) (Dianova, clone HOY, DIA-HO9, 1:100), rabbit 
monoclonal anti-CD3 (Roche, clone 2GV6, 790-4341, prediluted), rabbit 
polyclonal anti-IBA1 (Wako, W1W019-19741, 1:500), mouse monoclonal 
anti-MLHI1 (Roche, clone M1, 790-4535, prediluted), mouse monoclonal 
anti-MSH2 (Roche, clone G219-1129, 760-4265, prediluted), mouse 
monoclonal anti-MSH6 (Roche, clone 44, 760-4455, prediluted), rabbit 
monoclonal anti-PMS2 (Roche, clone EPR3947, 760-4531, prediluted). 
For immunohistochemistry performed at BWH, the following anti- 
bodies were used: mouse monoclonal anti-MLHI1 (Leica, clone ESOS, 
MLH1-L-CE, 1:75), mouse monoclonal anti-MSH2 (Merck Millipore, clone 
Ab-2-FE11, NA27, 1:200), mouse monoclonal anti-MSH6 (Leica, clone 
PU29, MSH6-L-CE, 1:50), mouse monoclonal anti-PMS2 (Cell Marque, 
MRQ-28, 288M-14-ASR, 1:100). An expert neuropathologist blinded to 
the molecular status of MMR deficiency analysed the immunostaining. 
If loss of expression of one or several MMR proteins was observed in 
tumour cells, this result was confirmed in an independent laboratory 
by asecond expert pathologist with separate stainer and reagents: 
FFPE sections were immunolabelled with a BOND stainer (Leica, Wet- 
zlar, Germany). Primary antibodies were as follows: mouse monoclo- 
nal anti-MLH1 (clone G168-728, BD Pharmingen), mouse monoclonal 
anti-MSH2 (clone 25D12, Diagnostic BioSystems), mouse monoclonal 
anti-MSH6 (clone 44, Diagnostic BioSystems), mouse monoclonal 
anti-PMS2 (clone A16-4, BD Pharmingen). The loss of expression of 
MMR proteins was defined as the total absence of nuclear labelling in 
tumour cells associated with a maintained expression in normal cells 
(as a positive internal control in the same tissue area). The density of 


the immune infiltrate was studied after immunolabelling of T lympho- 
cytes by CD3 and of macrophage/microglial cells by IBA1. The num- 
ber of immunopositive cells was quantified by visual counting in the 
three areas (one square millimetre) of tumour tissue containing the 
highest density of immunopositive cells and a mean density was cal- 
culated. 


Patient-derived cell lines 

All PDCLs with a name starting with BT were established from tumours 
resected at Brigham and Women’s Hospital and Boston Children’s 
Hospital (Boston, MA) and were maintained in neurosphere growth 
conditions using the NeuroCult NS-A Proliferation Kit (StemCell 
Technologies) supplemented with 0.0002% heparin (StemCell Tech- 
nologies), EGF (20 ng/ml), and FGF (10 ng/ml; Miltenyi) ina humidified 
atmosphere of 5% CO, at 37 °C. The N16-1162 PDCL was established by 
the GlioTex team (Glioblastoma and Experimental Therapeutics) at the 
Institut du Cerveau et dela Moélle épiniére (ICM) laboratory and main- 
tained as described above. SU-DIPG-XIII (DIPG13) cells were provided 
by Dr. Michelle Monje at Stanford University and were maintained in 
neurosphere growth conditions in a humidified atmosphere of 5% CO, 
at 37 °Cin tumour stem medium (TSM) consisting of Dulbecco’s modi- 
fied Eagle’s medium: nutrient mixture F12 (DMEM/F12), neurobasal-A 
medium, HEPES buffer solution 1 M, sodium pyruvate solution 100 nM, 
non-essential amino acids solution 10 mM, Glutamax-I supplement 
and antibiotic-antimycotic solution (Thermo Fisher). The medium was 
supplemented with B-27 supplement minus vitamin A, (Thermo Fisher), 
20 ng/ml human-EGF (Miltenyi), 20 ng/ml human-FGF-basic (Miltenyi), 
20 ng/ml human-PDGF-AA, 20 ng/ml human-PDGF-BB (Shenandoah 
Biotech) and 2 pg/ml heparin solution (0.2%, Stem Cell Technologies). 
The identity of all cell lines established was confirmed by short tandem 
repeat assay or sequencing. All cell lines were tested for the absence 
of mycoplasma. Cell lines, xenografts, and model data available from 
the DFCI Center for Patient Derived Models. 


Viability assays 

For short-term viability assays, cells were plated in 96-well plates and 
treated the following day with temozolomide (Selleckchem) or CCNU 
(Selleckchem) for 7-9 days incubation. Fresh medium was added after 
four days of incubation. Cell viability was assessed using the lumines- 
cent CellTiter-Glo reagent (Promega) according to the manufacturer’s 
protocol. Luminescence was measured using the Modulus Microplate 
Reader (Promega). The surviving fraction (SF) for each [X] concentra- 
tion was calculated as SF = mean viability in treated sample at concentra- 
tion [X]/mean viability of untreated samples (vehicle). Dose-response 
curves and IC,,. were generated using Prism 8 (GraphPad Software, 
San Diego, USA) after log transformation of the concentrations. Curves 
were extrapolated using nonlinear regression with four-parameter 
logistic regression fitting on triplicates from survival fractions of three 
independent replicates, following the model: y= Bottom + (Top - Bot- 
tom)/(1+10([logIC;, - x] x HillSlope)). 


Generation of Isogenic MMR- deficient cell lines 

Oligos of the form 5’-CACCG[N20] (where [N20] is the 20-nucleotide tar- 
get sequence; sgGFP, GAGCTGGACGGCGACGTAAA;sgMSH2, ATTCTGTT 
CTTATCCATGAG; sgMSH6, TTATTGGAGTCAGTGAACTG; sgMLH1, 
ACTACCCAATGCCTCAACCG; sgPMS2, TCACTGCAGCAGCGAGTATG) 
and 5’-AAAC[rc20]C (where [rc20] is the reverse complement of [N20]) 
were purchased from Integrated DNA Technologies (IDT). For DIPG13 
cells, oligos containing the sgRNA target sequence were annealed 
with their respective reverse complement and cloned into the len- 
tiCRISPR all-in-one sgRNA/Cas9-delivery lentiviral expression vector 
(pXPR_BRDOOI; now available as pXPR_BRDO23 lentiCRISPRv2) from 
the Broad Institute Genetics Perturbation Platform (GPP). For BT145 
cells, oligos containing the sgRNA target sequence were annealed with 
their respective reverse complement and cloned into the pXPR_BRDOS1 


CRISPRko all-in-one sgRNA/Cas9-delivery lentiviral expression vec- 
tor (available from the Broad Institute Genetics Perturbation Plat- 
form, GPP). Successful cloning of each sgRNA target sequence was 
confirmed via Sanger Sequencing. To generate lentivirus from these 
vectors, HEK293T cells were transfected with 10 pg of each expression 
plasmid with packaging plasmids encoding PSPAX2 and VSVG using 
lipofectamine. Lentivirus-containing supernatant was collected 48 
and 72 h after transfection. DIPG13 and BT145 cells were seeded ina 
12-well plate at 1-3 x 10° cells/well in 3 ml medium and spin-infected 
(2,000 rpm for 2 h at 30 °C with no polybrene) with pLX311-Cas9 
(DIPG13) or pXPR_BRDOS1 (BT145) lentiviral vectors and selected with 
blasticidin (10 pg/ml, DIPG13) or hygromycin (300 pg/ml, BT145) 
to generate Cas9-expressing or knockout cells. DIPG13-Cas9 cells 
underwent a subsequent lentiviral spin-infection with the lentiCRISPR 
sgGFP, sgMSH2, or sgMSH6 vectors described above. Puromycin selec- 
tion (0.4 pg/ml for DIPG13 cells) commenced 48 h post-infection. 


Chronic treatment and sequencing of isogenic MMR-deficient 
celllines 

DIPG13-sgGFP, -sgMSH2, and -sgMSH6 cells were seeded at 8 x 10° 
cells/well in 4 ml medium ina 6-well ULA plate. Each line was grown for 
3 months under 3 conditions: no treatment, temozolomide (100 uM, 
Selleckchem), or DMSO vehicle. Cells were grown under these con- 
ditions in the absence of both blasticidin and puromycin. Cells were 
re-dosed with temozolomide or DMSO every 3-5 days, splitting 
over-confluent cells 1:2 or 1:4 as needed. After 3 months, genomic DNA 
was extracted using the QlAmp DNA Mini Kit. DNA was subjected to 
whole-exome Illumina sequencing. Reads were aligned to the Human 
Genome Reference Consortium build 38 (GRCh38). WES data were 
analysed using the Getz Lab CGA whole-exome sequencing charac- 
terization pipeline (https://docs.google.com/document/d/1VO2kX_fgf 
Ud0Ox3mBS9NjLUWGZu794WbTepBel3cBg08/edit#heading=h.yby87 
12ztbcj) developed at the Broad Institute which uses the following 
tools for quality control, calling, filtering and annotation of somatic 
mutations and copy number variation: PicardTools (http://broadinsti- 
tute.github.io/picard/) ContEst*®, MuTect1”, Strelka®, Orientation Bias 
Filter®°, DeTiN®, AllelicCapSeg, MAFPoNFilter®, RealignmentFilter, 
ABSOLUTE, GATK*S, Variant Effect Predictor®, and Oncotator*. 


Subcutaneous xenografts and drug treatment 

BT145 cells (2 x 10°) were resuspended in equal parts Hank’s buffered 
salt solution (Life Technologies) and Matrigel (BD Biosciences) and 
then injected into both flanks of eight-week-old NU/NU male mice 
(Charles River). Tumour-bearing mice (n= 8) were randomly assigned 
to the treatment or vehicle arm when tumours measured a volume 
of 100 mm’. Animals received 12 mg/kg/day temozolomide or vehi- 
cle (Ora-Plus oral suspension solution, Perrigo, Balcatta, Australia) 
by oral gavage for 5 consecutive days per 28-day cycle. An additional 
4 weeks resting period without treatment was observed before the 
second cycle. Tumour volumes were calculated using the formula: 
0.5 x length x width”. Body weights were monitored twice weekly. The 
investigators were not blinded to allocation during experiments and 
outcome assessment. Mice were euthanized when they showed signs 
of tumour-related illness or before reaching the maximum tumour bur- 
den. Tumours were subsequently removed, anda subset were submitted 
to Oncopanel sequencing for analyses of exonic mutations (POPv3, 447 
genes) and mutational signature as defined above. To separate human 
and mouse sequenced reads in the DNA sequencing data generated for 
the PDX models, the ‘raw’ data were mapped to both the hg19 human 
and mm10 mouse reference genomes using BWA-MEM-0.7.17. The out- 
put of the alignment was name sorted by Samtools-1.7. We then used 
the software package Disambiguate (ngs_disambiguate-1.0) to assign 
each read to the human or mouse genome and to produce final align- 
ment files in BAM format. Final hg19 BAM files were coordinate sorted 
by Samtools-1.7. Duplicate reads were marked and removed from the 


BAM files using Picard-2.0.1. GATK4.1.0.0 base recalibration was per- 
formed using BaseRecalibration and Applying Recalibration followed 
by CollectF1R2Counts and LearnReadOrientationModel to create a 
model for read orientation bias. Variant calling was performed using 
GATK-4.1.0.0/Mutect2 pipeline with the default parameters and filters 
except for the following modifications: (i) ‘af-of-alleles-not-in-resource’ 
was set to O; (ii) ‘MateOnSameContigOrNoMappedMateReadFilter’ was 
disabled; (iii) the output of Step8 was used for fitting the read orienta- 
tion model; and (iv) agermline resource from the gnomAD database was 
included (https://console.cloud.google.com/storage/browser/_details/ 
gatk-best-practices/somatic-b37/af-only-gnomad.raw.sites.vcf). The 
capture targets intervals used for Mutect2 were POPv3. The generated 
variant calls were further filtered using the FilterMutectCalls mod- 
ule of GATK4.1.0.0 and the final output in VCF format was annotated 
with Ensemble Variant Effect Predictor (ensembl-vep-96.0) using vcf- 
2maf-1.6.16. The calls were additionally annotated with the OncoKB 
dataset using oncokb-annotator and sorted as MAF files. 

Allin vivo studies were performed in accordance with Dana-Farber 
Cancer Institute animal facility regulations and policies under protocol 
number 09-016. 


Immunoblotting 

Proteins were extracted in lysis RIPA buffer (SO mM Tris, 150 mM NaCl, 
5mMEDTA, 0.5% sodium deoxycholic acid, 0.5% NP-40, 0.1% SDS) sup- 
plemented with protease inhibitor cocktail (Roche Molecular). Proteins 
were quantified using the PierceBCA Protein Assay Kit, according to the 
manufacturer’s protocol. Samples were then prepared with 1x NUPAGE 
(Invitrogen) LDS sample buffer, and NuPAGE (Invitrogen) sample reduc- 
ing agent followed by heating to 95 °C for 5 min. The samples were 
then loaded onto NuPAGE 4-12% Bis-Tris Gel (Invitrogen) with NUPAGE 
MOPS SDS (Invitrogen) buffer and run through electrophoresis. The 
transfer onto membrane was then done at 40 V overnight at 4 °C in 
NuPAGE transfer buffer (Invitrogen) with 10% methanol. Membranes 
were blocked with 5% skim milk in TBST for 1h, then incubated with the 
following primary antibodies added to 5% BSA and incubated overnight 
at 4 °C onashaker: mouse monoclonal anti-MGMT (Millipore, MT3.1, 
MAB16200, 1:500), mouse monoclonal anti-MSH2 (Calbiochem, FE11, 
NA27, 1:1,000), mouse monoclonal anti-MSH6 (Biosciences, 44, 610918, 
1:500), mouse monoclonal anti-MLHI (Cell Signaling, 4C9C7, 3515, 
1:500), mouse monoclonal anti-PMS2 (BD Biosciences, A16-4, 556415, 
1:1,000), mouse monoclonal anti-beta-actin (Sigma, AC-74, A2228, 
1:10,000). After several cycles of washing and incubation with second- 
ary goat anti-mouse antibody (Invitrogen 31430, 1:10,000), membranes 
were imaged by chemiluminescence using the Biorad ChemidocTM 
MP imaging system. 


Microsatellite instability analysis 

PCR amplification of the five mononucleotide markers (BAT25, BAT26, 
NR21, NR24, MONO27) was performed with the MSI Analysis System kit 
(Version 1.2, Promega). PCR products were analysed by an electropho- 
retic separation on the polymer POP7 50cm in an Applied Biosystems 
3130XL sequencer and using Genemapper Software 5. 


Outcome of patients treated with PD-1 blockade 

For comparison of PFS and OS in patients treated with PD-1 pathway 
blockade according to TMB and MMR statuses, we retrospectively 
identified patients with glioma who had been treated with PD-1 block- 
ade (alone or in combination with bevacizumab) for recurrent disease 
at our institutions. Patients for whom sequencing was not performed 
at the time of recurrence were excluded. Magnetic resonance imag- 
ing (MRI) tumour assessments were reviewed using the Response 
Assessment in Neuro-Oncology (RANO) criteria by three independent 
reviewers (M.J.L.-F., S.A., and R.Y.H.) who were blinded to the groups. 
PFS and OS duration were calculated from cycle 1 day 1 of PD-1 block- 
ade therapy. 
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Statistical analyses 

Data were summarized as frequencies and proportions for categori- 
cal variables and as median and range for continuous variables. 
Continuous variables were compared using Mann-Whitney or Kruskal- 
Wallis tests; categorical variables were compared using Fisher’s exact or 
Chi-squared tests. Survival and PFS were estimated using the Kaplan- 
Meier method, and differences in survival or PFS between groups were 
evaluated by the log-rank test. Survival for subjects who were alive or 
lost to follow-up at the time of last contact on or before data cut-off 
was censored at the date of the last contact. Patient matching ina k-to-k 
fashion was conducted using coarsened exact matching according to 
diagnosis, primary versus recurrent status, and prior treatments. For 
evaluation of response to PD-1 blockade, patients with glioma from 
the DFCI-Profile cohort who were treated with anti-PD(L)-lantibodies 
or other treatments (total n = 210) as part of their management were 
included in the analysis. For multivariable analysis, Cox proportional 
hazard regression was used to investigate the variables that affect sur- 
vival. P values were considered statistically significant when <0.05. 
Statistical analyses were performed using STATA (v14.2, StataCorp LLC, 
College Station, USA), Prism 8 (GraphPad Software, San Diego, USA), 
and MedCalc Statistical Software, version 19.0.3 (MedCalc Software 
bvba, Ostend, Belgium). For enrichment analyses, mutated genes were 
considered significant when Q< 0.01. Where applicable, the means of 
population averages from multiple independent experiments (+s.d. 
or S.e.m.) are indicated. No statistical methods were used to prede- 
termine sample size. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Clinical and sequencing data from 1,495 samples from the DFCI-Profile 
and 545 samples from the MSKCC-IMPACT datasets are publicly avail- 
able (GENIE v.6.1: https://genie.cbioportal.org or https://www.synapse. 
org/). All data for samples from the GENIE v.6.1 and TCGA pan-cancer 
datasets are publicly available. Data for samples from the FMI dataset 
are not publicly available, but de-identified, aggregated data can 
be accessed on request. dbGaP Study Accession: phs001967.v1.p1. 
All other data are available on request. 


Code Availability 


The code for the detection of microsatellite mutations in single-cell 
DNA sequencing is publicly available (https://github.com/parklab/ 
MSlIprofiler). 
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Extended Data Fig. 1| Overview of the clinical characteristics of the 545 and 8,121 gliomas with sequencing from the MSKCC-IMPACT and FMI 
patients in the study and analyses performed. a, Clinical datasets analysed datasets, respectively, and used them asa replication set (total set of 10,294 
and main demographics including age, histomolecular subtype and disease sequenced samples). In addition, 314 tumours—including 247 consecutive 
stage. 1,628 glioma samples from adult and paediatric patientsweresequenced _ recurrent gliomas—were analysed for protein expression of four MMR proteins 
as part of alarge institutional prospective sequencing program of consented (MSH2, MSH6, MLHI1, and PMS2) using immunohistochemistry. b, Analyses 


patients (DFCI-Profile) and subsequently clinically annotated. We identified performed and key clinical questions that were addressed in the study. 
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Extended Data Fig. 2| Distributions of TMB, homopolymer indels, and SNV 
mutation spectra in the datasets used. a, DFCI-Profile (de novo gliomas only); 
b, MSKCC-IMPACT; c, FMI (total n= 9,938). After examining the distribution of 
TMB in each dataset for breakpoints, thresholds for hypermutation were 
further confirmed using segmented linear regression analysis (analysis 
restricted to de novo gliomas for DFCI-Profile). This method showed the 
presence of a breakpoint at 17.0 and 8.7 mutations per Mb for the DFCI-Profile 
and FMI datasets, respectively. For the MSKCC-IMPACT dataset, the cutoff used 


for hypermutation (13.8 mutations per Mb) was previously determined”. 

The frequency of hypermutation was similar in the three datasets (85 (5.2%) in 
DFCI-Profile; 29 (5.3%) in MSKCC-IMPACT; 444 (5.5%) in FMI). The median tumour 
mutation burden (TMB) in the combined datasets was 2.6 mutations per Mb 
(range, 0.0-781.3). Compared with non-hypermutated gliomas, hypermutated 
tumours showed atypical patterns of SNVs, consistent with abnormal mutational 
processes operating in these samples. Bars represent median and interquartile 
range for each dataset (right). HPI, homopolymer indels. 
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Extended Data Fig. 3 | Integrated analysis of tumour mutation burdenin 
hypermutated gliomas in the DFCI-Profile, MSKCC-IMPACK and FMI 
datasets. a, Distribution of TMB, homopolymer indels, MMR mutations, and 
SNV mutational spectrum according to molecular status of JDH1/2, 1p/19q 
co-deletion (1p/19q), gain of chromosome 7 and/or deletion of chromosome 10 
(7gain/10del), and MGMT promoter methylation, histological grade, age at 
initial diagnosis, and history of prior treatment with alkylating agents or 
radiation therapy (the distinction between photon and proton therapy was not 
systematically captured) in the DFCI-Profile dataset (n= 84, data not shown for 
the single sample from other gliomas, /DH1/2-wt subgroup). b, Top, distribution 
of histomolecular groups in non-hypermutated and hypermutated gliomas 
from the combined sequencing dataset (n= 2,173). Bottom, distribution of 
molecular groups in de novo and post-treatment hypermutated gliomas from 
the DFCI-Profile dataset (n=85) (annotation not available for the 
MSKCC-IMPACT set). c, Prevalence of hypermutation according to MGMT 
promoter methylation and /DH1/2 mutation status in post-temozolomide 
gliomas from the DFCI-Profile dataset (n =150). Two-sided Fisher’s exact test. 
d, Number of temozolomide cycles according to /DH1/2 mutation statusin 
post-temozolomide diffuse gliomas from the DFCI-Profile dataset (n= 211 
gliomas). Patients who received combined chemoradiation but no adjuvant 
temozolomide were included. Two-sided Wilcoxon rank-sum test. e, Boxplots 
of TMB in post-treatment hypermutated gliomas according to the number of 


temozolomide cycles received before surgery. Kruskal-Wallis test and Dunn’s 
multiple comparison test. f, TMB in recurrent gliomas according to treatments 
received before surgery. Patients who received multiple treatment modalities 
were excluded. Kruskal-Wallis test and Dunn’s multiple comparison test. 
Boxes, quartiles; centre lines, median ratio for each group; whiskers, absolute 
range (d-f). g, Integrated analysis of the FMI dataset (n=8,121 gliomas) 
depicting tumour mutation burden, the number of indels at homopolymer 
regions, andthe SNV mutation spectrum detected in each tumour according to 
molecular status of /DH1/2 and 1p/19q co-deletion (1p/19q), MSI status, and age 
at initial diagnosis. Dominant mutational signatures detected in hypermutated 
samples are depicted. The dotted line indicates the threshold for samples with 
a high mutation burden (8.7 mutations per Mb). h, Prevalence of 
hypermutation among molecularly defined subgroups in the FMI dataset 
(n=8,121 gliomas). Chi-squared test. i, Dominant mutational signatures 
detected in hypermutated samples in the FMI dataset (n=8,121 gliomas). 
Chi-squared test.j, Mutated genes and pathways enriched in hypermutated 
gliomas inthe FMI dataset (n= 8,121). Enrichment was assessed using a 
permutation test to control for random effects of hypermutability in tumours 
with high TMB. k, I, Proportion of TMB"®" versus TMB’ samples with 
mutations in selected DNA repair genes and glioma drivers (e) and inthe MMR 
pathway (MSH2, MSH6, MLH1 and PMS2; f). Permutation test; ****P< 0.0001, 
***P< 0.001, **P< 0.01; ns, not significant. 
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Extended Data Fig. 4| See next page for caption. 


Extended Data Fig. 4| Validation of known hypermutation-associated 
signatures using TCGA datasets. Mutational signatures were predicted using 
exome-sequencing variants that overlapped with the panel-targeted regions, 
and then compared to previously published DeconstructSigs signature 
predictions based onall exome variants. The TCGA MC3 dataset was used to 
assess the detection of COSMIC mutational signatures associated with 
APOBEC (signatures 2 and 13), mismatch repair (signature 6), ultraviolet light 
(signature 7), POLE (signature 10), and tobacco (signature 4). Variant calls for 17 
hypermutated and 12 non-hypermutated glioma exome-sequenced samples* 
were used to assess temozolomide (signature 11) detection. a, Detection of 
APOBEC-associated mutational signature in TCGA BLCA samples (n=129 out 
of 411samples). b, Detection of ultraviolet-associated mutational signature in 
TCGA SKCM samples (n=237 out of 466 samples). c, Detection of tobacco 
smoking-associated mutational signature in TCGA LUAD samples (n= 250 out 
of 513 samples). d, Detection of MMR-associated mutational signature in TCGA 
COAD (n=188 out of 380 samples). e, Detection of POLE-associated mutational 
signature in TCGA COAD and READ samples (n= 277 out of 380 samples). 

f, Detection of temozolomide-associated mutational signature in ref. * 
(n=29).g, Unsupervised clustering of hypermutated samples. A total 


of 865 hypermutated tumour samples from exomes (pan-TCGA and Wang 

et al.*) and targeted panels (DFCI-Profile and MSK-IMPACT) were analysed for 
known hypermutation signatures (tobacco, UV, MMRD, POLE, TMZ, APOBEC). 
Samples and signatures underwent 2D hierarchical clustering based on 
Euclidean distance. h, Performance of cancer panel versus other genesetsin 
mutational signature calling. We analysed 622 hypermutated tumour exomes 
(pan-TCGA and Wang et al.*, black) for their mutational signature contributions 
when restricted to variants from i) DFCI-Profile OncoPanel cancer panel genes 
(red), or ii) 9 randomly selected gene sets (grey) of similar total capture size to 
the cancer panel. For eachsample, we assessed known hypermutation 
signatures for cancer panels and gene sets for which at least 20 single base 
substitutions were retained in the sample after restriction. Samples and 
signatures underwent 2D hierarchical clustering based on Euclidean distance. 
i, The violin plots represent the number of variants (top) and the cosine 
similarity of signature contributions (bottom) when using all exonic variants 
versus restriction to cancer panel or the 49 random gene sets. Boxes, quartiles; 
centre lines, median ratio for each group; whiskers, absolute range. Two-sided 
Welch’s ¢-test. 
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Extended Data Fig. 5 | See next page for caption. 
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Extended Data Fig. 5 | Mutational signature analysis of primary and 
secondary hypermutated cohort (n=111).a, Mutational signature analysis of 
newly diagnosed hypermutated gliomas in the DFCI-Profile dataset (n = 24). 

b, Mutational signature analysis of secondary hypermutated gliomas (samples 
in which hypermutation was detected in the recurrent tumour) inthe 
DFCI-Profile dataset (n=58). The novel COSMIC Signature 11-related signature 
(S2) was associated with 1p/19q co-deletion and lack of prior radiation therapy 
(66.7% of samples with high S2 versus 26.2% of samples with high S1 signature, 
Fisher P=0.016). c, Mutational signature analysis of hypermutated gliomas 
fromthe MSKCC-IMPACT dataset (n=29). d, Mutational signature analysis in 
de novo (hypermutated at first diagnosis, n=26, left) and post-treatment 
hypermutated gliomas (hypermutationina recurrent tumour, n=59, right). 
Percentage of samples exhibiting the most common mutational signatures 
and their hypothesized causes are displayed. MMR, C6, C14, C15, C26; 


age-related, Cl; POLE, C10, C14. Chi-squared test. e, Mutational signatures 
identified in individual de novo hypermutated gliomas (hypermutated at first 
diagnosis, n= 26, left) and post-treatment hypermutated gliomas 
(hypermutationinarecurrent tumour, n=59, right). f, Mutational signature 
analysis of MMR variants in hypermutated gliomas from the DFCI-Profile and 
MSKCC-IMPACT datasets (n=114). Ninety variants of the MMR genes MSH2, 
MSH6, MLH1, and PMS2were merged into two groups (de novo, n=18; 
post-treatment, n=72) according to the type of sample in which they were 
found and analysed for mutational signatures using a regression model 
(Rosenthal et al.°”). In each sample, only the MMR variant with the highest VAF 
was included, to limit the inclusion of possible passenger variants. For 
signature discovery in both cohorts (a-c), variants were analysed using the 
non-negative matrix factorization (NMF) method and correlated with known 
COSMIC mutational signatures" using Pearson correlation. 
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Extended Data Fig. 6| See next page for caption. 


Extended Data Fig. 6 | Characteristics of MMR molecular variants in 
hypermutated gliomas. a, b, Proportion of TMB"*" versus TMB samples with 
mutations in selected DNA repair genes and glioma drivers (a) and in the MMR 
pathway (MSH2, MSH6, MLH1 and PMS2) (b) in the merged DFCI-Profile/ 
MSKCC-IMPACT dataset (n = 2,173). Permutation test; ****P< 10°, *P<107, 
*P<0.05.c, CCFs of MMR gene mutations in post-treatment hypermutated 
gliomas versus other hypermutated cancers in the FMI dataset. Horizontal line, 
median. Two-sided Wilcoxon rank-sum test with Benjamini-Hochberg 
correction. d, VAF distribution of mutations in post-treatment hypermutated 
gliomas, non-glioma MMR-deficient cancers (diverse histologies) and other 
non-glioma hypermutated samples (diverse histologies) from the TCGA and 
MSKCC-IMPACT datasets. Each dot represents a mutation found in an individual 
sample (represented vertically). MMR mutations are depicted in red. Left, 


hypermutated samples from the pan-TCGA dataset; right, hypermutated 
samples from the MSKCC-IMPACT dataset. e, Integrated view of mutational 
signatures and MMR gene mutations and protein expression in hypermutated 
gliomas (n=114). Tumours with the mutational hotspot MSH6(T1219]) (11.9% of 
post-treatment hypermutated gliomas) are highlighted. f, Mutation diagram of 
MSH2, MSH6, MLH1, and PMS2 mutations found in hypermutated gliomas from 
the DFCI-Profile and MSKCC-IMPACT datasets (n=114). The hotspot MSH6 
missense variant p.T12191 was found in nine samples. g, Hotspot MSH6 p.112191 
variant mapped to the bacterial MutS 3D structure (PDB 5YK4). h, Representative 
immunohistochemistry (IHC) images of the MMR proteins MSH2, MSH6, MLH1 
and PMS2 ina hypermutated glioblastoma with MSH6(T1219I) mutation. Three 
independent samples were stained. Scale bar, 100 tm. 
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Extended Data Fig. 7 | See next page for caption. 


Extended Data Fig. 7 | Results of MMRIHC screening in 213 consecutive 
recurrent gliomas and patterns of MMR protein expression loss in three 
de novo or post-treatment MMR-deficient gliomas. a, Recurrent patterns of 
MMR protein loss identified by IHC in gliomas. Scale bar, 50 um. b, Summary of 
MMR IHCscreening results for 213 consecutive recurrent gliomas. All 
monocentric consecutive relapses of diffuse gliomas in adult patients 
following treatment with post-alkylating agents (surgery between 2009 and 
2015) were included in the immunohistochemistry analysis. Further 
sequencing of samples in which MMR protein loss was identified showed 
hypermutation with MMR molecular defects in 18/19 (94.7%) samples. 

c, Percentage of tumour MMR protein loss in glioma samples with de novo 
(n=16) or post-treatment (n=46) MMR deficiency. Samples were scored by two 
pathologists in blinded fashion. Regional heterogeneity of MMR protein loss 
for the four MMR proteins MSH2, MSH6, MLH1, and PMS2 was scored as to the 
maximal percentage of protein loss among tumour cells for each sample (5% 
increments). Boxes, quartiles; centre lines, median ratio for each group; 
whiskers, absolute range, excluding outliers. Two-sided Wilcoxon rank-sum 
test.d, Clonal MMR deficiency ina de novo high-grade glioma. Top left, low 
magnification of haematoxylin and eosin (H&E) staining of the large surgical 
tumour pieces obtained from surgical resection. Right, high magnification in 
three tumour areas (H&E staining, MLH1 and PMS2 immunostaining) showing a 
highly cellular tumour with an oligodendroglial phenotype and aloss of 
expression of MLH1and PMS2 in all tumour cells (open arrowheads). Normal 
cells have a maintained MLH1 and PMS2 expression (solid arrowheads). Bottom 
left, microsatellite testing via PCR amplification of five mononucleotide 
markers (BAT25, BAT26, NR21, NR24, and MONO27) showed the tumour to be 
MSS. Array CGH showed a homozygous deletion of the entire coding region of 
MLHIL. Scale bars; top left, 5 mm; right, 100 pm. e, Clonal MMR deficiencyina 
hypermutated post-treatment, /DH1-mutant glioblastoma. Top left, 


low-magnification image of H&E staining of tissue obtained from surgical 
resection, with three areas of tumour selected for images. Red dashed line 
delimits normal brain. Right, high-magnification images of H&E staining, 
showing highly cellular tumour and an astrocytic phenotype, and PMS2 IHC, 
showing loss of expression of PMS2 in all tumour cells (open arrowheads). 
Normal cells have maintained PMS2 expression (internal control, solid 
arrowheads). Bottom left, Microsatellite testing via PCR amplification of five 
mononucleotide markers (BAT25, BAT26, NR21, NR24, and MONO27) showed 
the tumour to be MSS. NGS showed a TMB of 120.1 per Mb and homopolymer 
indel burden of 3.8 per MB, with contributions from temozolomide (90%) and 
MMR-deficiency (10%) mutational signatures. A missense (p.P648L) hotspot 
MLHI1 mutation known to be pathogenic from patients with Lynch syndrome 
witha VAF of 0.73 and loss of heterozygosity was present in this case. Scale 
bars, top left, 5mm; right 100 pm. f, Subclonal MMR deficiency ina 
hypermutated post-treatment /DH1-mutant glioblastoma. Top left, 
low-magnification image of PMS2immunostaining of the tumour pieces 
obtained from surgical resection. Right, high magnification images of three 
areas of PMS2 immunostaining showing heterogeneous PMS2 expression 
across the sample consistent with a subclonal tumour. Area 1 shows that PMS2 
is retained in atypical tumour cells (arrow); area 2 is heterogeneous with loss 
(open arrowhead) insome but not all tumour cells; area 3 is an example of 
diffuse loss of expression in tumour cells (open arrowhead). Normal cells have 
a maintained PMS2 expression (solid arrowheads in all images). Bottom left, 
microsatellite analysis via PCR amplification of five mononucleotide markers 
(BAT25, BAT26, NR21, NR24, and MONO27) showed the tumour to be MSS. NGS 
showed a TMB of 236.5 per Mb and homopolymer indel burden of 2.3 per MB, 
with 95% contribution of temozolomide mutational signature. Scale bars, top 
left 5mm; right 100 pm. 
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Extended Data Fig. 8 | Characterization of high-grade glioma PDCLs and 
their sensitivity to temozolomide and CCNU. a, Clinico-molecular 
characteristics of four native newly diagnosed or recurrent glioma PDCL 
models harbouring hypermutation and MMR deficiency. b, Thirty glioma 
PDCLs, including four PDCLs derived from patients with de novo (BT1160, 
N16-1162, both established from patients with Lynch syndrome) or 
post-treatment (BT237, BT559) MMR deficiency were molecularly 
characterized using whole-exome sequencing. The panels show the tumour 
mutational burden (left) and homopolymer indel burden (right) ineach model. 
Boxes, quartiles; centre lines, median ratio for each group; whiskers, absolute 
range. Two-sided Wilcoxon rank-sum test. c, Mutational signature analysis was 
performed inthe PDCL models of constitutional and post-treatment MMR 
deficiency using the R package DeconstructSigs to estimate the contributions 
of mutational signatures using aregression model (Rosenthalet al.*”). For each 
PDCL, the contribution of the main COSMIC mutational signatures identified is 


expressed as decimal. d, Boxplots of temozolomide AUC in non-hypermutated 
versus hypermutated PDCLs. Boxes, quartiles; centre lines, median ratio for 
each group; whiskers, absolute range. Two-sided Wilcoxon rank-sum test. 

e, f, A panel of 12 glioma PDCL models representing the different MGMT and 
MMR classes was selected and assessed for sensitivity to temozolomideina 
short-term viability assay (e; dots represent means). The temozolomide AUC 
was compared between the three groups using a Kruskal-Wallis test and Dunn’s 
multiple comparison test (f; mean+s.d.). g, Western blot of the glioblastoma 
patient-derived cell line (BT145) in which the genes MSH2, MSH6, MLHI1 or PMS2 
have been knocked-out using the CRISPR-Cas9 system. h, i, A panel of 11glioma 
PDCL models representing the different MGMT and MMR classes was selected 
and assessed for sensitivity to CCNU ina short-term viability assay (h; dots 
represent means). No CCNU data was available for the model BT172. The CCNU 
AUC was compared between the three groups using a Kruskal-Wallis test and 
Dunn’s multiple comparison test (i; mean+s.d.). 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | MMR-deficient models of glioma, continued. 

a, b, CRISPR-Cas9 MSH2 and MSH6 gene knockout in DIPG13 high-grade glioma 
cellline. a, Integrated genomics viewer (IGV) plots depicting MSH2 reads in 
between the guide RNAs in the MSH2 unedited line (sgGFP, left) and the MSH2 
CRISPR knockout line (right) confirming knockout in the MSH2 edited line. 

b, IGV plots depicting MSH6 reads in between the guide RNAs in the MSH6 
unedited line (sgGFP, left) and the MSH6 CRISPR knockout line (right) 
confirming knockout in the MSH6 edited line. c, Overview of in vivo 
temozolomide resistance study. Treatment of subcutaneous BT145 
PDX-bearing animals was initiated at a volume of 1OO mm? and eight nude mice 
per group were randomized to 12 mg/kg/day temozolomide or vehicle for five 
consecutive days per 28-day cycle. Mice were treated until tumours reacheda 
volume of 1,500 mm’, and tumours were sequenced to identify mutations and 
mutational signature. d, Survival of mice with BT145 xenografts (n=8 mice per 
group) during treatment with vehicle (blue) or temozolomide (red). Two-sided 
log-rank test. e, Unique variants found in three sequenced BT145 tumours 
(two temozolomide-treated, and one vehicle-treated) were analysed for 


correlation with known mutational signatures. COSMIC Signature 11 was found 
inthe two temozolomide-treated tumours. Mutational signatures could not be 
called in the vehicle-treated tumour (too few variants). After filtering of truncal 
variants common toall tumours, the two temozolomide-treated tumours 
shared only four variants, including an MSH6(T12191) mutation and three 
noncoding variants. f, BT145 xenografts chronically treated with vehicle (n =1) 
or temozolomide (n= 2) were removed, dissociated and cultured in serum-free 
medium to establish cell lines. After three passages in culture, sensitivity to 
temozolomide was assessed. The results of the short-term viability assays 
(mean +s.e.m.) and temozolomide AUC of each cell line are depicted. g, Model 
of acquired hypermutation with mutational signature 11 in gliomas. Top, 
MMER-proficient cells repair TMZ damage and do not develop signature 11. 
Resistance in these cells is mediated by non-MMR pathways (for example, 
MGMT expression). Bottom, TMZinduces and/or selects resistant subclonal 
MMR-deficient cells. Further TMZ exposure produces accumulation of 
mutations at specific trinucleotide contexts, detected as hypermutation 

with signature 11. 
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Extended Data Fig. 10| See next page for caption. 


Extended Data Fig. 10| Extended outcome data. a-c, Survival of patients 
with recurrent high-grade glioma (WHO grade Ill or IV) from the time of initial 
diagnosis according to TMB status (solid curves, TMB'™; dotted curves, 
TMB"®"), The curves include 240 recurrent samples from DFCI-Profile 

with available survival data from initial diagnosis. Two-sided log-rank test. 

a, Survival of patients with recurrent high-grade 1p/19q co-deleted 
oligodendroglioma from the time of initial diagnosis. b, Survival of patients 
with recurrent high-grade /DH1/2-mutant astrocytoma from the time of initial 
diagnosis. c, Survival of patients with recurrent /DH1/2 wild-type glioblastoma 
fromthe time of initial diagnosis. d, PFS of 11 patients with hypermutated and 
MMR-deficient glioma who were treated with PD-1 blockade (single-agent or in 
combination with bevacizumab, red curve). A cohort of patients with 
non-hypermutated glioma who were treated with PD-1 blockade is depicted as 
control (n=10, best matches according to diagnosis, primary versus recurrent 
status, and prior treatments, blue curve). A two-sided log-rank testis used. 


e, f, PFS (e) and OS (f) of 11 patients with hypermutated and MMR-deficient 
glioma who were treated with PD-1 blockade (red curves). A cohort of 
hypermutated patients treated with other systemic agents is depicted as 
control (best matches according to diagnosis, primary vs recurrent status, and 
prior treatments were selected from the cohort of sequenced gliomas, purple 
curves). Two-sided log-rank test. Clinical and histomolecular characteristics of 
patients from both cohorts are provided in Supplementary Table 7. g, Lack of 
immune response following PD1 blockade (pembrolizumab) ina patient 

with post-treatment hypermutated MMR-deficient glioblastoma. Top, 
timeline; middle, MRI images; bottom, H&E images and IHC for PMS2 
expression and tumour infiltration with CD3-positive T cells and IBA1-positive 
macrophages in the primary (S1), recurrent pre-pembrolizumab (S3) and 
recurrent post-pembrolizumab (S4) tumours. The tumour acquired a focal 
PMS2two-copy deletion, protein loss, and hypermutation in the 
post-temozolomide recurrent tumour (S3). Scale bar, 50 pm. 
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Extended Data Fig. 11|See next page for caption. 


Extended Data Fig. 11| Molecular characteristics of hypermutated 
gliomas. a, Pan-cancer analysis of TMB and homopolymer indel burdenin 

the GENIE dataset (n=44,389). Tumour samples from the GENIE dataset (v6.1) 
were analysed for mutational and homopolymer indel burden. Statistical 
comparisons between groups are provided in Supplementary Table 6. b, TMB 
in hypermutated gliomas (post-treatment) versus MMR-deficient cancers and 
other hypermutated cancers from the TCGA and Wang et al.* exome datasets 
(n=798). Two-sided Wilcoxon rank-sum test with Bonferroni correction. 

c, Pan-cancer analysis of the homopolymer indel burdenin hypermutated 
gliomas (post-treatment) versus MMR-deficient cancers and other 
hypermutated cancers from the TCGA and Wang et al.* exome datasets 
(n=798).d, Results of MSI analysis using the standard pentaplex assay in 
glioma (n= 39) and CRC samples (n=19) according to MMR status (MMR-d, 
MMR deficient; MMR-p, MMR-proficient). e, Pan-cancer analysis of cancer cell 
fractions inhypermutated gliomas (post-treatment) versus MMR-deficient 
cancers and other hypermutated cancers from the TCGA and Wang et al.* 
exome datasets (n=798). Two-sided Wilcoxon rank-sum test with Bonferroni 
correction. f, Weighted TMB in hypermutated gliomas (post-treatment) versus 
MMER-deficient cancers and other hypermutated cancers fromthe TCGA and 
Wang etal.* exome datasets (n = 798). The weighted TMB was calculated by 
weighing each individual mutation to its cancer cell fraction. Two-sided 
Wilcoxon rank-sum test with Bonferroni correction. g, Distribution of VAFs 
(left) and mutation spectrum analysis of low-allelic frequency variants (<O.1, 
right) in TMB gliomas (n=1,543, top), de novo hypermutated gliomas with 
MMR deficiency mutational signature (n =12, middle), and post-treatment 
hypermutated gliomas (n= 59, bottom) from the DFCI-Profile dataset. 

h, Distribution of VAFs (left) and mutation signature analysis of low-allelic 
frequency variants (<0.1, right) in TMB’ CRCs (n=1,265, top) and TMB"®" CRCs 
with MMR deficiency mutational signature (n = 110, bottom) from the GENIE 


dataset. i, Clinical timeline for the patient with hypermutated glioblastoma 
with an MSH6(T12191) mutation in whom bulk and single-cell WGS was 
performed. j, Distribution of VAFs of mutations inthe recurrent bulk sample. 
The median VAF in the recurrent sample was 0.11. The MSH6(T12191) mutation 
had the 18""-highest VAF out of 4,350 coding mutations. k, Cancer cell fractions 
(CCFs) of mutations inthe primary and recurrent tumour bulk samples. Each 
dot represents a coding mutation. The horizontal and vertical axes are 
estimated clonal frequency for each mutation in the primary and recurrent 
samples, respectively. Mutations of the four main MMR genes are depicted 
inred.I, Mutational spectra in 35 cells from the primary tumour (orange) 

and 28 fromthe recurrent tumour (green) submitted to scWGS sequencing 
(1x). Mutational signature analysis showed a strong contribution of 

mutational signature 11in hypermutated cells from the recurrent tumour. 

m, Representative IGV plots (n=2 distinct genomic segment for each sample) 
of microsatellite insertions in the normal (TMB low) and recurrent (TMB 

high) bulk samples and recurrent TMB low (n=2) and TMB high (n=2) single 
cells. Solid arrowheads represent microsatellite insertions phased witha 
flanking heterozygous SNP allele. Open arrowheads represent microsatellite 
insertions for which the reads do not reach the flanking heterozygous SNP 
allele. Both hypermutated single cells showed multiple phased microsatellite 
insertions consistent witha true somatic microsatellite mutation. In general, a 
few reads with similar microsatellite insertions correctly phased with the same 
flanking heterozygous SNP allele were found inthe recurrent bulk, but not in 
the normal bulk or non-hypermutated cells. For ac, e, f, biological subgroups 
were identified on the basis of mutational burden, dominant signature and 
histology. For b, c, e, f, 100 non-hypermutated samples were randomly selected 
as controls. For all box plots: boxes, quartiles; centre lines, median ratio for 
each group; whiskers, absolute range, excluding outliers. RT, radiation therapy; 
Cil, cilengitide; Cabo, cabozantinib; Bev, bevacizumab. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


m4 The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


— For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


[| For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection No software used for data collection 


Data analysis For WES analyses, we used the CGA WES Characterization pipeline developed at the Broad Institute to call, filter and annotate somatic 
mutations and copy number variation. The pipeline employs the following tools: MuTect[1], ContEst[2], Strelka[3], Orientation Bias 
Filter[4], DeTiN [5], AllelicCapSeg[6], MAFPoNFilter[7], RealignmentFilter, ABSOLUTE[8], GATK[9], PicardTools[10], Variant Effect Predictor 
[11], Oncotator [12]. 

1. MuTect1: Cibulskis, K, Lawrence, MS & Carter, SL. Sensitive detection of somatic point mutations in impure and heterogeneous cancer 
samples. Nature ... (2013). doi:10.1038/nbt.2514 

2. ContEst: Cibulskis, K. et al. ContEst: estimating cross-contamination of human samples in next-generation sequencing 
data.Bioinformatics 27, 2601-2 (2011). 

3. Strelka: Saunders, C. T. et al. Strelka: accurate somatic small-variant calling from sequenced tumor-normal sample pairs. Bioinformatics 
28, 1811-7 (2012). 

4. Orientation Bias Filter (oxoG, FFPE): Costello, M. et al. Discovery and characterization of artifactual mutations in deep coverage 
targeted capture sequencing data due to oxidative DNA damage during sample preparation. Nucleic Acids Res. 41, e67 (2013). 

5. DeTiN: Taylor-Weiner, A. et al. DeTIN: overcoming tumor-in-normal contamination. Nat Methods 15, 531-534 (2018). 

6. AllelicCapSeg: Landau, D. A. et al. Evolution and impact of subclonal mutations in chronic lymphocytic leukemia. Cell 152, 714-26 
(2013). 

7. MAFPoNFilter: Lawrence, M. et al. Discovery and saturation analysis of cancer genes across 21 tumour types. Nature 505, 495 (2014). 
8. ABSOLUTE: Carter, S. L. et al. Absolute quantification of somatic DNA alterations in human cancer. Nat. Biotechnol. 30, 413-21 (2012). 
doi: 10.1038/nbt.2203. 

9. GATK (Mutect2, somatic CNV): McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next- 
generation DNA sequencing data.Genome Res. 20, 1297-303 (2010). 

10. Picard Tools: https://software.broadinstitute.org/gatk/documentation/tooldocs/4.0.1.0/ 
picard_fingerprint_CrosscheckFingerprints.php 
https://software.broadinstitute.org/gatk/documentation/tooldocs/4.0.0.0/picard_analysis_CollectMultipleMetrics.php 
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11. Variant Effect Predictor: McLaren, W. et al. The Ensembl Variant Effect Predictor.Genome Biol. 17, 122 (2016). 
12. Oncotator: Ramos, A. H. et al. Oncotator: cancer variant annotation tool.Hum. Mutat. 36, E2423—9 (2015). 


Additional tools used for PDX analyses, mutational signature analyses and statistical analyses included: 

1. SAMtools (1.7): Li, H. et al A statistical framework for SNP calling, mutation discovery, association mapping and population genetical 
parameter estimation from sequencing data. Bioinformatics 2011;27:2987-93 

2. BWA-MEM (0.7.17): https://github.com/lh3/bwa 

. Disambiguate (ngs_disambiguate-1.0): https://github.com/AstraZeneca-NGS/disambiguate 

. Genemapper Software (5): https://www.thermofisher.com/order/catalog/product/4475073#/4475073 
. SomaticSignature (3.1): https://bioconductor.org/packages/release/bioc/html/SomaticSignatures.html 
. DeconstructSigs (1.8): https://github.com/raerose01/deconstructSigs 

. STATA (v14.2): https://www.stata.com 

edCalc (19.0.3): https://www.medcalc.org 

. Graphpad Prism (8): https://www.graphpad.com/scientific-software/prism/ 


OMNDURW 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


Clinical and sequencing data from 1495 samples from the DFCI-Profile and 545 samples from the MSKCC-IMPACT datasets are publicly available (GENIE v4.1: 
https://www.synapse.org/). All data for samples from the GENIE v6.1 and TCGA pan-cancer datasets are publicly available. Data for samples from the FMI dataset 
are not publicly available. 

Detailed clinical annotation for the DFCI-Profile and MSKCC-IMPACT cohorts is provided in supplementary table 1. 
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x Life sciences Behavioural & social sciences [| Ecological, evolutionary & environmental sciences 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size No statistical test was used to determine the sample size. We systematically collected data from 10,294 glioma samples from three large 
independent datasets. Based on prior literature (prevalence of hypermuation of 2-5% in gliomas), we hypothesized that this sample size 
would provide enough power (200-500 hypermutated samples) for clinical and molecular association studies. 


Data exclusions 47 samples for which the clinical diagnosis of glioma could not be confirmed (other histology or possible non-tumor sample) and 5 samples 
with missing clinical annotation were excluded from all analyses. Exclusion criteria were pre-established. 


Replication In vivo experiments were performed using groups of 8 mice per group. All mice were included in the analysis. 
In vitro sensitivity assays were replicated in at least 3 independent experiments. All experiments that were technically valid were included in 


the analysis. 


Randomization For in vivo experiments, mice were randomized. 


Blinding No blinding was performed. Blinding was not relevant to our study. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 
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Materials & experimental systems Methods 


n/a | Involved in the study n/a | Involved in the study 

L_| Antibodies [|_| ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used For immunohistochemistry, the following antibodies were used: mouse monoclonal anti-ATRX (Bio SB, clone BSB-108, BSB3296, 
1:100), mouse monoclonal anti-IDH1 R132H (Dianova, clone HO9, DIA-HO9 , 1:100), rabbit monoclonal anti-CD3 (Roche, clone 
2GV6, 790-4341, prediluted), rabbit polyclonal anti-IBA1 (Wako, W1W019-19741, 1:500), mouse monoclonal anti-MLH1 (Roche, 
clone M1, 790-4535, prediluted), mouse monoclonal anti-MSH2 (Roche, clone G219-1129, 760-4265, prediluted), mouse 
monoclonal anti-MSH6 (Roche, clone 44, 760-4455, prediluted), rabbit monoclonal anti-PMS2 (Roche, clone EPR3947, 760-4531, 
prediluted). For immunohistochemistry performed at BWH, the following antibodies were used: mouse monoclonal anti-MLH1 
(Leica, clone ESO5, MLH1-L-CE, 1:75), mouse monoclonal anti-MSH2 (Merck Millipore, clone Ab-2-FE11, NA27, 1:200), mouse 
monoclonal anti-MSH6 (Leica, clone PU29, MSH6-L-CE, 1:50), mouse monoclonal anti-PMS2 (Cell Marque, MRQ-28, 288M-14- 
ASR, 1:100). For immunobloting, the following antibodies were used: mouse monoclonal anti-MGMT (Millipore, MT3.1, 
MAB16200, 1:500), mouse monoclonal anti- MSH2 (Calbiochem, FE11, NA27, 1:1000), mouse monoclonal anti-MSH6 
(Biosciences, 44, 610918, 1:500), mouse monoclonal anti-MLH1 (Cell Signaling, 4C9C7, 3515, 1:500), mouse monoclonal anti- 
PMS2 (BD Biosciences, A16-4, 556415, 1:1000), mouse monoclonal anti-beta-actin (Sigma, AC-74, A2228, 1:10000). 
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Validation All antibodies used are commercially available and were validated by the manufacturer. 


Eukaryotic cell lines 


Policy information about cell lines 
Cell line source(s) All patient-derived cell lines (PDCLs) and xenografts (PDXs) with a name starting with "BT" were established from tumors 
resected at Brigham and Women's Hospital and Boston Children's Hospital (Boston, MA). SU-DIPG-XIII (DIPG13) cells were 
provided by Dr. Michelle Monje at Stanford University. 
Authentication The identity of all cell lines established were confirmed by short tandem repeat assay. 


Mycoplasma contamination All cell lines were tested for the absence of mycoplasma. 


Commonly misidentified lines No commonly misidentified cell lines were used. 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Eight-week-old NU/NU male mice (Charles River). 

Wild animals The study did not involve wild animals. 

Field-collected samples The study did not involve samples collected from the field. 

Ethics oversight All in vivo studies were performed in accordance with Dana-Farber Cancer Institute animal facility regulations and policies under 


the protocol #09-016. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics We systematically analyzed clinical data and somatic tumor variants identified through targeted next-generation sequencing 
(NGS) panels of 1,628 gliomas sequenced between June 2013 and November 2018 as part of a large institutional prospective 
profiling program (DFCI-Profile). All samples were assigned to a molecular subgroup based on histopathology, mutational status 
of IDH1 and IDH2 genes, and whole-arm co-deletion of chromosomes 1p and 19q (1p/19q co-deletion). A total of 545 
independent samples from the GENIE project (a repository of genomic data obtained during routine clinical care at international 
institutions) were also identified and assigned to molecular subgroups. The combined sequencing set comprised 2,173 glioma 
samples, which included a broad spectrum of newly-diagnosed and recurrent glioma types including IDH1/2 wild-type 


glioblastomas (1234 tumors, 56.8%), IDH1/2-mutant gliomas (640, 29.5%), and other rare subtypes of IDH1/2 wild-type gliomas 
(299, 13.8%). In addition, 247 gliomas collected at one site between 2009 and 2017 were analyzed for targeted protein 
expression using immunohistochemistry. 


Recruitment Genomic testing was ordered by the pathologist or treating physician as part of routine clinical care to identify relevant genomic 
alterations that could potentially inform diagnosis and treatment decisions. All patients undergoing genomic testing signed a 
clinical consent form, permitting to return results from clinical sequencing. No systematic bias likely to impact results were 
known at the time of data analysis. 


Ethics oversight The study, including the consent procedure, was approved by the institutional ethics committees (10-417/11-104/17-000; WIRB, 
Puyallup WA). 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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The initiation of an intestinal tumour is a probabilistic process that depends on the 
competition between mutant and normal epithelial stem cells in crypts’. Intestinal 
stem cells are closely associated with a diverse but poorly characterized network of 
mesenchymal cell types”*. However, whether the physiological mesenchymal 
microenvironment of mutant stem cells affects tumour initiation remains unknown. 
Here we provide in vivo evidence that the mesenchymal niche controls tumour 
initiation in trans. By characterizing the heterogeneity of the intestinal mesenchyme 
using single-cell RNA-sequencing analysis, we identified a population of rare 
pericryptal Ptgs2-expressing fibroblasts that constitutively process arachidonic 
acid into highly labile prostaglandin E, (PGE,). Specific ablation of Ptgs2in fibroblasts 
was sufficient to prevent tumour initiation in two different models of sporadic, 
autochthonous tumorigenesis. Mechanistically, single-cell RNA-sequencing analyses 
of amesenchymal niche model showed that fibroblast-derived PGE, drives the 
expansion of a population of Sca-1* reserve-like stem cells. These express a strong 
regenerative/tumorigenic program, driven by the Hippo pathway effector Yap. 

In vivo, Yap is indispensable for Sca-1* cell expansion and early tumour initiation and 
displays a nuclear localization in both mouse and human adenomas. Using organoid 
experiments, we identified a molecular mechanism whereby PGE, promotes Yap 
dephosphorylation, nuclear translocation and transcriptional activity by signalling 
through the receptor Ptger4. Epithelial-specific ablation of Ptger4 misdirected the 
regenerative reprogramming of stem cells and prevented Sca-1" cell expansion and 
sporadic tumour initiation in mutant mice, thereby demonstrating the robust 
paracrine control of tumour-initiating stem cells by PGE,—Ptger4. Analyses of 
patient-derived organoids established that PGE,-PTGER4 also regulates stem-cell 
function in humans. Our study demonstrates that initiation of colorectal cancer is 
orchestrated by the mesenchymal niche and reveals a mechanism by which rare 
pericryptal Ptgs2-expressing fibroblasts exert paracrine control over tumour- 
initiating stem cells via the druggable PGE,-Ptger4—Yap signalling axis. 


Mesenchymal cells are localized in tight association with stem cells single-cell RNA-sequencing (RNA-seq) analysis in the mouse intesti- 
in crypts, separated from them by a layer of extracellular matrix less nal mesenchyme. By sequencing 3,179 non-epithelial, non-immune 
than a1pm in thickness (Extended Data Fig. 1a, b). Toinvestigatethe —_ cells weidentified all major mesenchymal cell types of the lamina pro- 
heterogeneity of intestinal mesenchymal cells and identify specific — pria, including vascular and lymphatic endothelial cells, pericytes, 
pathways that could control stem-cell dynamics, we performed smoothmusclecells and glial cells (Fig. 1a). Our analyses also revealed 
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Fig. 1| Single-cell analyses of the intestinal mesenchyme reveal arare 
fibroblast population that expresses Ptgs2 and its protein product Cox-2, 
located under the crypts. a—c, Single-cell RNA-seq of 3,179 mesenchymal cells 
from the normal mouse colon. a, t-distributed stochastic neighbour 
embedding (t-SNE) plot with clustering results. F, fibroblasts. MF, 
myofibroblasts. b, Violin plots showing the entire range of mesenchymal 
marker gene expression levels per single cell in each cluster. Endoth, 


the existence of four different fibroblast populations—which we des- 
ignate F1 to F4—all characterized by expression of Pdgfra (Fig. 1b); F1 
and F2 cells are Pdgfra’, whereas F3 and F4 cells are Pdgfra"®" (Fig. 1b). 
A similar division of fibroblasts into PDGFRA high and low popula- 
tions was found ina single-cell dataset? of the human colonic mesen- 
chyme (Extended Data Fig. 1h). Confocal and two-photon imaging in 
Pdgfra®"*-knockin mice* confirmed the presence of Pdgfra™®" and 
Pdgfra' fibroblasts and revealed a distinct localization of these two 
populations in the adult intestine (Extended Data Fig. 2a, b). A dichoto- 
mous and compartmentalized expression of Pdgfra was observed in 
the intestinal mesenchyme throughout postnatal development and on 
embryonic day 15, when Pdgfra"" cells are associated with early villus 
formation, whereas Pdgfra'™ cells occupy the rest of the mesenchyme 
(Extended Data Fig. 2c). Among Pdgfra'™ cells, F1 fibroblasts comprise 
two subsets (Fla and Flb), both marked by expression of Cd34 and Has1 
(Fig. 1b). Population F2 expresses Cd34 and Fgfr2 and comprises four 
subsets (F2a-F2d) occupying diverse niches in the intestine, as shown 
in Fgfr2™°Y-knockin mice? (Fig. 1b, Extended Data Fig. 2d). Among 
Pdgfra"®" cells, F3 cells express the Cajal-cell marker Ano2 whereas 
F4 cells express Acta2, consistent with a myofibroblast phenotype 
(Fig. 1b). To understand the potential functions of these uncharacter- 
ized populations we performed pathway analyses. We observed a robust 
enrichment of arachidonic acid metabolism genes in F3 (Cajal) cells and 
inthe small population of F2d fibroblasts (Extended Data Fig. 1g, Sup- 
plementary Table 1), a pathway strongly associated with colorectal can- 
cer®. Arachidonic acid is processed by cyclooxygenases to prostanoids, 
highly bioactive lipid mediators witha very short half-life and an auto- 
crine or paracrine function in tissues®. In humans, pharmacological 
inhibition of cyclooxygenase-2 (Cox-2, also known as Ptgs2) prevents 
both hereditary and sporadic forms of colorectal cancer through an 
unknown cellular mechanism, but adverse cardiovascular effects cur- 
rently impede its clinical application’. By fractionating normal human 
colonic tissues we found that expression of the Cox-2-encoding gene 
PTGS2 is nearly undetectable in the epithelium but occurs predomi- 
nantly in stromal cells; the same pattern as observed in the mouse 
intestine (Extended Data Fig. Ic, d). Our single-cell analyses showed that 
inthe steady-state, mouse intestine Ptgs2is predominantly expressed 
in F3 (Cajal) cells and in the Pdgfra’ Fgfr2' Vcam1"®" F2d fibroblasts 
(cluster 13) (Fig. 1c, Extended Data Fig. 1f). In humans, P7GS2is mainly 
expressed in PDGFRA'™ FGFR2'VCAMI fibroblasts (cluster 8) and, toa 
lesser extent, in other fibroblast populations (Extended Data Fig. 1h). 


endothelial; lymph, lymphatic; pericyt, pericytes; SMC, smooth muscle cells. 
c, Ptgs2 expression levels per single cell visualized by t-SNE plot. 

d, Immunostaining for Cox-2 (green) and vimentin (white) in the normal mouse 
ileum and colon. Cox-2-expressing fibroblasts are located under the crypt (C) 
epithelium (white arrowheads) and in the muscularis propria (MP). Results are 
representative of three independent experiments. Scale bars, 20 pm. 


Immunostaining of Cox-2 protein in the mouse intestine verified the 
presence of a Cox-2-expressing fibroblast population in the muscular 
layer, a location consistent with that of Cajal cells, and a second rare 
Cox-2-expressing fibroblast population located around part of the 
crypts, in close proximity to the stem-cell zone, suggestive of the F2d 
fibroblast cluster (Fig. 1d). We named these cells rare pericryptal Ptgs2- 
expressing fibroblasts (RPPFs). In agreement with their pericryptal 
location, RPPFs are marked by expression of the laminin subunit Al 
(encoded by Lama1),abasement membrane protein detected specifi- 
cally at the mesenchymal-epithelial interface, and also by expression 
of R-spondin 1 (encoded by Rspo1), a stem-cell niche factor detected 
mainly at the crypt base (Fig. 1b, Extended Data Fig. 2e, f). 

Given the localization of RPPFs within the stem-cell microenviron- 
ment, we aimed to understand their role as a potential mesenchymal 
niche of tumour-initiating stem cells. We used fibroblast-specific 
Col6-cre mice, which target a substantial fraction of Pdgfra* intestinal 
fibroblasts, including fibroblasts surrounding the crypts and Cox- 
2-expressing fibroblasts, as shown by lineage tracing, flow cytometry 
in reporter mice and quantitative PCR with reverse transcription 
(RT-qPCR) analyses in Col6-Cre* cells separated by fluorescence-acti- 
vated cell sorting (FACS) (Extended Data Fig. 3a-c). Specific ablation 
of Cox-2in Col6-cre’ fibroblasts in Col6-cre-Ptgs2” (Ptgs2“"*") mice was 
efficient and led to a significant reduction of Ptgs2 expression lev- 
els in the whole tissue (Extended Data Fig. 3d, e), thereby confirming 
that fibroblasts are a predominant source of Cox-2 in the steady-state 
intestine. 

Toaddress the role of RPPFs inthe mesenchymal niche in early tumour 
initiation, we used the Apc“"* mouse model in which autochthonous 
intestinal tumours are spontaneously formed by stem cells losing 
heterozygosity*. This model is highly relevant to human cancer, since 
somatic or germline mutations in the APC gene, anegative regulator of 
Wnt--catenin signalling, drive sporadic or hereditary forms, respec- 
tively, of intestinal neoplasia. Intestinal stem cell-specific Apc ablation 
is sufficient to drive tumorigenesis®. Notably, although adenoma forma- 
tionin Apc-mutant mice is known to be Cox-2-dependent, the pathway 
by which this is mediated remains unknown’. Thus, we generated 
Apc™"/* Ptgs2*"i* mice and studied adenoma formation. We found that 
specific ablation of Ptgs2in fibroblasts led to a strong reduction inthe 
number of microadenomas formed in the small intestine at the early 
stage of 5 weeks (Fig. 2a) and significantly fewer macroscopic tumours 
formed at 5.5 months (Fig. 2b), along with a milder splenomegaly anda 
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Fig. 2 | Ptgs2-expressing fibroblasts drive tumour initiation by secreting 
PGE, inthe crypt microenvironment. a, Number of B-catenin* 
microadenomas in two sections of the small intestine (SI) of 5-week-old 
Apc™"Ptgs2" (n= 6) and Apc™"*Ptgs2“"""' (n= 6) mice. Two-tailed t-test. 

b, Number of macroscopic adenomas in5.5-month-old Apce™™*Ptgs2” (n=16) 
and Apc™""*Ptgs2“"> (n = 23) mice. Two-tailed Mann-Whitney test. c, Number of 
macroscopic adenomas in 5.5-month-old Apce™"*Ptgs2™ (n=7) mice in which 
Ptgs2 expression is blocked, and Apc™™*Prgs2*#'’ (n= 11) littermates in which 
Ptgs2is exclusively expressed in fibroblasts (Extended Data Fig. 3i). Two-tailed 
Mann-Whitney test (duodenum), t-test (jejunum and colon), Welch’s t-test 
(ileum and total small intestine). d, Incidence of dysplasia and microadenoma 
development in the colon of Ptgs2” (n=30) and Ptgs2***" (n=24) mice treated 
with 10 weekly injections of azoxymethane (AOM). Two-sided Fisher’s exact 
test. e, HPLC-MS/MS analysis of prostanoids in the ileum of littermate Ptgs2” 
(n= 9) and Ptgs2“**' (n=7) mice. The relative intensity to the respective internal 
standard is shown. Two-tailed t-test (PGE, and PGF,,) and Mann-Whitney test 
(PGI, and PGD,). Data are mean +s.e.m. NS, non-significant; *P< 0.05; **P< 0.01; 
***P< 0.001; ****P<0.0001. 


significantly prolonged survival (Extended Data Fig. 3f, g). We observed 
no difference in tumour size in Apc™”* Ptgs2“"" mice (Extended Data 
Fig. 3h), which showed that mesenchymal Cox-2is necessary for tumour 
initiation but not for tumour growth. To understand how critical 
Cox-2 expression in the mesenchymal niche is for tumour initiation, 
compared with other cellular sources of prostanoids, we examined 
whether selective Cox-2 expression in fibroblasts is sufficient to drive 
tumour initiation in Apc-mutant mice. For this purpose, we genetically 
engineered mice in which a loxP-stop-loxP cassette was knocked into 
the Ptgs2 gene, thereby preventing its expression (Ptgs2°") unless 
excised by Cre-mediated recombination, which reactivates the gene 
(Ptgs2°") (Extended Data Fig. 3i). By crossing these with Col6-cre mice, 
we generated mice in which Ptgs2 is expressed exclusively in fibroblasts 
(Ptgs2"*'O’), We found that control Apc“ Ptgs2™ mice developed only 
a few intestinal tumours, consistently with the phenotype of Ptgs2- 
knockout mice in this model’. By contrast, Apc” Ptgs2N mice—in 
which Ptgs2 is expressed exclusively in fibroblasts—displayed robust 
tumorigenesis and developed, on average, 30 adenomas per mouse 
in the small intestine (Fig. 2c). Thus, Cox-2 in fibroblasts is both nec- 
essary and sufficient for tumour initiation in Apc-mutant mice. To 
further establish the role of Cox-2-expressing fibroblasts in controlling 
tumorigenesis, we used a model of sporadic colonic tumorigenesis, 
whichis driven by random mutations caused by repeated injections of 
azoxymethane, a potent mutagenic agent. We found that Ptgs2“"" mice 
displayed a significantly lower incidence of dysplasia and adenoma 
formation inthe colon at 28 weeks of age (Fig. 2d), along with a reduced 
number—but not reduced size—of adenomas and dysplastic foci per 
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mouse (Extended Data Fig. 3j). These results from two different mod- 
els demonstrated in vivo that fibroblasts utilize the Cox-2 pathway to 
provide atumour initiation-conductive microenvironment for mutated 
stem cells in the intestine. Thus, we show that resident fibroblasts 
physiologically control tumorigenesis in trans. 

To identify which Cox-2-mediated prostanoids are secreted by 
fibroblasts in the crypt microenvironment in vivo and drive tumour 
initiation, we performed lipidomic analyses by liquid chromatogra- 
phy-tandem mass spectrometry (HPLC-MS/MS) in the intestine of 
Ptgs2*"" mice. We identified a significant reduction in the relative abun- 
dance of PGE, and prostaglandin I2 (PGI,) in the whole tissue (Fig. 2e), 
consistent with the expression of the respective synthases, Ptges and 
Ptgis, in fibroblasts (Extended Data Fig. 1g), and atrend for decreased 
prostaglandin D2 (PGD,) and prostaglandin F,, (PGF,,.) abundance. 
Out of these prostaglandins, PGE, promotes sporadic tumour for- 
mation” but its cellular source, its cellular target and the receptor 
through which it acts have remained unknown. These are important 
considerations for therapeutic targeting in light of the adverse effects 
of Cox-2 inhibition’. Since the estimated half-life of PGE, in vivo is less 
than 15 s (ref. *), we hypothesized that PGE, secreted by RPPFs may 
diffuse through the thin (less than 1 pm) basal lamina matrix and reach 
the neighbouring mutant stem cells at a concentration sufficient to 
signal and drive tumour initiation. 

To study the effect of PGE, on intestinal stem-cell function we 
cultured crypts in organoid growth media (OGM) supplemented 
daily with 16,16-dimethyl PGE, (dmPGE,), a derivative of PGE, witha 
prolonged half-life. Notably, PGE, prevented the formation of bud- 
ding organoids, leading instead to the development of spheroid-like 
structures which lack the typical crypt-villus architecture (Extended 
Data Fig. 4a). This morphology is associated with poor differentiation 
and increased stemness™. We functionally assessed stem-cell activity 
and found that PGE,-driven spheroids contain more stem cells with 
full organoid-forming capacity (Extended Data Fig. 4b). PGE, signals 
through four receptors, EP1-EP4 (encoded by Ptgerl—Ptger4, respec- 
tively), all of which are expressed in the mouse intestine (Extended 
Data Fig. 5a). We found that EP4 is the major PGE, receptor expressed 
inthe mouse intestinal epithelium, in stem and progenitor cells and in 
the normal human colon (Extended Data Fig. 5b-d). We then generated 
intestinal epithelial cell-specific Villin-cre-Ptger4” mice (Ptger4“"°) and 
found that unlike control crypts, crypts from these mice were able to 
form budding organoids even when dmPGE, was added to the OGM 
(Fig. 3a). On the basis of these results, we focused on the specific role 
of epithelial Ptger4 in tumour initiation. 

To model the mesenchymal niche of intestinal stem cells, we devel- 
oped a3D organotypic co-culture comprised of primary intestinal 
fibroblasts and fresh crypts growing in OGM. When co-cultured with 
fibroblasts, the majority of crypts developed into spheroids rather 
than organoids (Fig. 3b). Addition of Ptger4 inhibitors or co-culture 
with Prger4“"“ crypts was sufficient to restore the growth of budding 
organoids (Fig. 3c, Extended Data Fig. 4c). Thus, organotypic cultures 
show that fibroblasts control stem-cell function via paracrine PGE,-— 
Ptger4 signalling. 

To understand the specific effects of fibroblast-derived PGE, on 
stem-cell function and differentiation we performed single-cell RNA- 
seqincrypt-fibroblast co-cultures in which Ptger4 signalling was either 
active (Ptger4-ON) or inhibited (Ptger4-OFF) (Fig. 3d). We analysed 
2,192 cells out of which 1,585 were epithelial (Extended Data Fig. 4d). By 
clustering and aligning these cells with signatures of known intestinal 
epithelial populations® (Extended Data Fig. 4e), we observed a mark- 
edly different cellular composition between Ptger4-ON and Ptger4-OFF 
co-cultures (Fig. 3d, e). First, fibroblast-derived PGE,-Ptger4 signalling 
prevented the terminal differentiation of enterocytes but not that of 
goblet or Paneth cells (Fig. 3e, f). Second, it induced a substantial 
expansion of a non-cycling population displaying an intermediate 
transcriptional profile between stem and tuft cells (cluster 3) (Fig. 3f-h, 
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Fig. 3 | Fibroblast-derived PGE, drives the expansion of RSCs witha 
Yap-driven pro-tumorigenic program via receptor Ptger4. a, Indicative 3D 
cultures of crypts isolated from Ptger4” and Ptger4“"« mice grown with OGM or 
OGM supplemented with 0.11.M dmPGE,. Scale bar, 100 pm. b, Normal crypts 
were cultured as 3D organoids in OGM (n=2) or inco-cultures with primary 
mouse intestinal fibroblasts (n=3). The absolute numbers of organoids or 
spheroids per 3D culture are shown. Results are representative of five 
independent experiments. Scale bar, 200 pm. c, Crypts isolated from Ptger4” 
and Ptger4“"¢ mice were grown in co-cultures with wild-type (WT) primary 
mouse intestinal fibroblasts (n=3 per genotype). The percentage of organoids 
and spheroids grown per 3D culture is shown. Results are representative of two 


Extended Data Fig. 4f). These cells express specific marker genes such as 
Ly6a (which encodes Sca-1), Clu, Msin and /lirn (Extended Data Fig. 6b). 
Cluis a marker of revival stem cells’, a quiescent population that func- 
tions as reserve stem cells (RSCs)™ and is induced upon irradiation 
damage in the intestinal epithelium. We found that the overall transcrip- 
tional program of cluster 3 cells is highly similar to that of RSCs (Fig. 3i, 
Extended Data Fig. 4g). Furthermore, Ptger4 is expressed in RSCs and 
is strongly induced following irradiation damage in the regenerative 
intestinal epithelium (Extended Data Fig. Se). 

To understand the molecular link between PGE,-Ptger4 and the 
RSC phenotype, we first tested whether PGE, activates the B-catenin 
pathway as reported in wound-associated epithelial cells”. However, 
we found a strong downregulation of the B-catenin transcriptional 
program in RSCs compared with cycling stem cells and no overall differ- 
ence in this pathway between the Ptger4-ON and Ptger4-OFF conditions 
(Fig. 3j, Extended Data Fig. 4h, k). By contrast, we observed that Ptger4- 
ON spheroids overexpressed a set of genes reported to be targets of 
Yap" as well as overall overexpression ofa Yap transcriptional program’® 
(Extended Data Figs. 4k, 6a, b). This effect was validated in independ- 
ent PGE,-driven spheroid cultures and confirmed genetically to be 
mediated by Ptger4 (Extended Data Fig. 6c-e). Yap is a transcriptional 
effector of Hippo signalling, whichis involved in stemness, organ size 
control, tissue homeostasis, regeneration and cancer” and is key for 
RSC function”. Yap is also indispensable for tumorigenesis driven by 
Apc-deficient stem cells'*”°. Indeed, we found that a signature of early 
Apc™"”* tumorigenesis correlated with the Yap program (Extended Data 
Fig. 6f) and both were predominantly expressed in RSCs (Fig. 3k, |, 
Extended Data Fig. 4i, j). 

To directly examine whether Yap mediates fibroblast-driven expan- 
sion of RSCs, we isolated crypts from intestinal epithelial cell-specific 


independent experiments. Scale bar, 100 pm. Inb,c, dataare mean+s.e.m.; 
two-way ANOVA, ****P< 0.0001. d-I, Single-cell RNA-seq of intestinal crypt- 
fibroblast co-cultures grown in OGM with 10 uM Ptger4 inhibitor (Ptger4-OFF) 
or DMSO (Ptger4-ON). Analyses are shown for 1,585 epithelial cells. d, SNE 
plot indicating epithelial cells from co-cultures with Ptger4-ON or Ptger4-OFF. 
e, SNE plot with clustering results. f, Proportion of each epithelial cluster 
among total epithelial cells in co-cultures with Ptger4-ON or Ptger4-OFF. 

g-l, Violin plots showing the entire range of metagene expression levels per 
single cell per cluster for the signatures/transcriptional programs of stem 
cells! (g), tuft cells’ (h), RSCs" (i), B-catenin (j), Yap (k) and early (non-tumour) 
Apc™"”* tumorigenesis (I). 


Villin-cre-Yapl™ mice (Yap1*"*) and co-cultured them with fibroblasts 
in OGM. Although Yap1‘"¢ crypts require epiregulin supplementa- 
tion to grow’, fibroblasts supported their growth in the absence of 
exogenous epiregulin (Extended Data Fig. 7a). Notably, unlike Yap?” 
crypts, Yap1“"“ crypts did not form spheroids in these co-cultures, 
but instead retained their crypt morphology (Extended Data Fig. 7a). 
Flow cytometry analyses for the RSC marker Sca-1 showed a robust 
expansion of Sca-1’ cells in co-cultures of fibroblasts with Yap#” crypts 
which was prevented in co-cultures with Yap1“"“ crypts (Extended Data 
Fig. 7b). Collectively, these analyses revealed that fibroblast-derived 
PGE, drives the expansion of an RSC population with a regenerative/ 
tumorigenic program via Ptger4 and Yap. 

Next, we examined how PGE, activates a Yap transcriptional program 
incrypts. In contrast to an earlier study using a cancer cell line”, stimula- 
tion of intestinal organoids with PGE, did not induce Yap/ expression at 
the RNA or at the protein level (Extended Data Fig. 6g, h). In addition, 
day 3 PGE,-driven spheroids showed no difference in total Yap protein 
expression (Extended Data Fig. 6g). Since G-protein-coupled receptor 
(GPCR) signalling has been suggested to either activate or inhibit the 
Hippo-Yap signalling pathway”, and Ptger4 is a GPCR, we hypothesized 
that PGE, may activate Yap in the intestinal epithelium by inhibiting its 
regulation by the Hippo pathway. Indeed, when we stimulated wild- 
type organoids with dmPGE,, we observed Yap dephosphorylation at 
Ser127 within 30-60 min (Extended Data Fig. 6i), suggestive of inhibi- 
tion of Hippo activity and activation of Yap. This effect was mediated 
by Ptger4 (Fig. 4a). Furthermore, stimulation of wild-type organoids 
with dmPGE, led to nuclear translocation of Yap within 30-60 min 
(Fig. 4b) and transcriptional activation of Yap target genes (Extended 
Data Fig. 6h), which was prevented by a Ptger4 inhibitor (Fig. 4c). PGE,- 
stimulation experiments with Yap1“" organoids or wild-type organoids 
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Fig. 4| Epithelial Ptger4 induces Yap nuclear translocation, mediates RSC 
mobilization and drives tumour initiation. a, b, Wild-type organoids 
pretreated with or without 10 pM Ptger4 inhibitor were stimulated with 0.144.M 
dmPGE,.Western blots for phosphorylated Yap Ser127 (pYap) and total Yap in 
cytoplasmic lysates (a), Yap and TBP in nuclear lysates (b). Results are 
representative of two experiments. c, d, Relative expression of Yap target 
genes'®. c, Wild-type organoids treated with 10 pM Ptger4 inhibitor and 0.1uM 
dmPGE, for13h.n=3to4 cultures per condition. One-way ANOVA. d, Yap1*"* 
and Yap?” organoids treated with 0.1 1M dmPGE, for 13 h. Three cultures per 
genotype per condition. Two-tailed t-test. e, f, Ptger4” (n= 3) and Ptger44"*¢ 
(n=3) mice received 14 Gy of abdominal irradiation. On day 3, the ileum was 
analysed by haematoxylin and eosin staining (e) and immunostaining for 
lysozyme (f). Results are representative of three independent experiments. 
Scale bars, 50 pm. g, Immunostaining for Yap in the small intestine of five-week- 


treated with verteporfin (an inhibitor of the Yap-Tead interaction”’) 
demonstrated that the activation of these genes is mediated by 
Yap (Fig. 4d, Extended Data Fig. 6j). These results establish that 
PGE,-Ptger4 signalling inhibits Hippo activity and, consequently, leads 
to Yap nuclear translocation and induction of a Yap-Tead-dependent 
gene-expression program in the intestinal crypt. 

Epithelial ablation of Ptger4 in Ptger4“" mice is efficient but does 
not affect stem-cell function and epithelial lineage differentiation inthe 
steady-state intestine, as assessed by single-cell RNA-seq, 5-bromo-2’- 
deoxyuridine (BrdU)-incorporation experiments and immunostaining 
for population markers (Extended Data Fig. 8a-f, i). Lineage tracing of 
Ptger4-deficient Lgr5* stem cells confirmed that Ptger4 is dispensable 
for the function of normal stem cells at the steady state (Extended 
Data Fig. 8h). Similar results were obtained in Ptgs2“"" mice (Extended 
Data Fig. 3k). To assess the role of PGE,-Ptger4—Yap in stem-cell repro- 
gramming in vivo, we employed an abdominal-irradiation-induced 
injury model inthe context of which slow-cycling RSC populations are 
mobilized and mediate the epithelial regenerative response”. In the 
absence of Yap, although no phenotypeis observed at the steady state, 
the epithelial response to irradiation is perturbed, causing increased 
Paneth cell differentiation’. We found that exposure of Prger4“"* mice 
to 14 Gy of abdominal irradiation led to a pronounced expansion of 
Paneth cells three days after irradiation (Fig. 4e, f), thereby phenocopy- 
ing the effect of Yap deficiency. These results functionally validate the 
critical function of Ptger4 for RSC mobilization in vivo. 

In early tumour initiation, we found that Yap displays an increased 
nuclear localization in microadenomas of five-week-old Apc™”* mice 
(Fig. 4g). Furthermore, we observed that Sca-1, a Yap target gene 
and RSC marker, is detected in the mesenchyme but not in the epi- 
thelium in the steady-state in wild-type mice. By contrast, however, 
in the intestine of five-week-old Apc“ mice, we observed areas of 
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5.5 months old 


old wild-type and Apc“ mice. Nuclear Yap is evaluated on the basis of 
colocalization with DAPI. Results are indicative of at least eight 
microadenomas. h, Immunostaining for Sca-1in the small intestine of five- 
week-old wild-type and Apc“"* mice. Results are representative of two 
independent experiments. i, k, Quantification of areas of the small intestine 
with expansion of Sca-I' epithelial cells in Apc” Yap1” (n=3) and 
Apc™"*Yap1“¢ (n =5) mice (i), and Apce™"*Ptger4” (n=7), Apc™™*Ptger4**¢ 
(n=5) mice (k). Two-tailed ¢-test.j, |, Number of BrdU‘ microadenomas per 
small-intestinal section of Apc™”“ Yap (n= 4) and Apc™*Yap1"* (n=5) mice 
(j), and Apc" Ptger4 (n=5) and Apc™*Prger4*"« (n=7) mice (I). Two-tailed 
Mann-Whitney test (j); two-tailed t-test (I). m, Number of macroscopic 
adenomas in5.5-month-old Apc“ Ptger4” (n= 9) and Apc Ptger4“" (n=8) 
mice. Two-tailed t-test (duodenum and colon); Welch’s f-test (jejunum and total 
small intestine); Mann-Whitney test (ileum). Data are mean+s.e.m. 


the epithelium in which Sca-1* epithelial cells were detected as local 
expansions, whereas Sca-1' epithelial cells were more widespread in 
microadenomas (Fig. 4h). Similar expansion of Sca-1* cells has been 
described in regenerative contexts suchas the response to irradiation 
and to helminth infection”. Given these data, we examined the role of 
Yap inthe expansion of Sca-1' cells and in tumour initiation. In five-week- 
old Apc“ Yap1“¢ mice, we found a markedly decreased number of 
Sca-1* areas in the epithelium compared with littermate Apc” Yapl” 
controls, as well as an almost completely abrogated formation of micro- 
adenomas (Fig. 4i,j). These results show that in early tumour initiation, 
Yap translocates to the nucleus and drives the expansion of Sca-1* cells 
and the formation of microadenomas. Nuclear localization of Yap and 
epithelial Sca-1 expression were also observed in developed tumours 
in five-month-old Apce™* mice (Extended Data Fig. 9a, b). Moreover, 
mice treated with repeated azoxymethane injections displayed nuclear 
localization of Yap in tumours and overexpression of the Yap target 
gene Clu (Extended Data Fig. 9c, d). 

To address whether fibroblast-derived PGE, activates the Yap pro- 
gram and drives tumour initiation via epithelial Ptger4, we generated 
Apc“ Ptger4“" mice. Given the crucial role of Yap intumour initiation, 
we first examined whether Ptger4 mediates the activation of Yap target 
genes in these mice. We found that at five weeks of age, Apc™””* mice 
displayed an increased expression of Yap target genes, but not of Yap1 
itself, compared with normal controls; however, in Apc™™ Ptger44"¢ 
littermates, this upregulation of the same Yap targets was abrogated 
(Extended Data Fig. 9e). Most notably, five-week-old Apce™”*Ptger44"¢ 
mice displayed an attenuated expansion of Sca-I’ cells inthe epithelium 
compared with their Apc“ Ptger4” littermates (Fig. 4k). Moreover, 
Apc“ Ptger4“"“ mice developed fewer microadenomasat five weeks 
of age (Fig. 41). They also displayed a strong reduction inthe number of 
macroscopic tumours formed at 5.5 months (Fig. 4m), a significantly 


alleviated splenomegaly (Extended Data Fig. 9f) and—consistent with 
arole of Ptger4 in tumour initiation rather than tumour growth—no 
difference in the size of the tumours formed (Extended Data Fig. 9g). 
These results provide definitive genetic evidence that epithelial Ptger4 
is the receptor that mediates the tumorigenic effect of PGE, and explain 
the role of RPPFs as paracrine drivers of tumour initiation. 

Onthe basis of these results, we addressed the role of PGE,-PTGER4 
inthe human intestinal stem-cell niche. By isolating crypts from normal 
parts of the colon from three patients and culturing them in OGM we 
found that PGE, drives the formation of spheroid-like structures. This 
effect was fully prevented by treatment with a PTGER4 inhibitor, thus 
confirming that PGE,-PTGER4 also controls stem-cell function in the 
human colonic crypt (Extended Data Fig. 10a). Furthermore, we per- 
formed immunostaining for YAP in tissues from 16 patients, including 
individuals with sporadic adenomas and adenocarcinomas, familial 
adenomatous polyposis, Lynch syndrome and cancer associated with 
inflammatory bowel disease (Supplementary Table 3). We observed that 
YAP displayed a nuclear localization in tumours but not in the neigh- 
bouring normal areas of the tissue in all these samples (Extended Data 
Fig. 10b), supporting its role in tumorigenesis. Of note, both PTGER4 
and YAP genetic loci were recently identified to be genetically associ- 
ated with colorectal cancer risk in genome-wide associations studies”, 
further underlining the relevance of this pathway to human disease. 

The results of this study show that PGE,-secreting RPPFs provide a 
micro-niche favouring the activation of the pro-tumorigenic Yap pro- 
gram in neighbouring stem cells, thereby driving tumorigenesis inthe 
presence of mutations (Extended Data Fig. 10c). This work establishes 
in vivo that the formation of intestinal tumours requires the parac- 
rine interaction of mutated stem cells with their native mesenchymal 
microenvironment. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized. The investigators were not blinded 
to allocation during experiments and outcome assessment unless 
otherwise stated in the Methods. 


Mice 

Ptgs2 mice” crossed with Col6-cre mice”®, Rosa26™"”"" mice”? and 
Apc™"”* mice*° were bred inthe animal facilities of the BSRC ‘Alexander 
Fleming’ under specific pathogen-free conditions. Ptger4” mice® and 
Yap¥” mice” crossed with Villin-cre mice@and Apc™”* mice®°, wild-type 
mice used for organoid experiments, Lgr5-eGFP-IRES-creERT2 mice**, 
Pdgfra°*” mice’, as well as Col6-cre mice”® crossed with Rosa26@7""-/* 
mice (Ai14)® and Ptgs2 Lox-Stop-Lox-knockin mice (Ptgs2‘; the genera- 
tion of this mouse strain is described below) were bred in the facilities 
of the Yale Animal Resources Center. All these mice were maintained 
onaC57BL/6] genetic background. Fefr2-T2A-H2B-mCherry mice® were 
maintained in the facilities of the Icahn School of Medicine at Mount 
Sinai on the 129S4 genetic background. Mice were housed in stand- 
ard cages, on a 12:12 h day:night cycle and were fed a standard rodent 
chow. Mice were used for experiments at 8-12 weeks of age unless 
otherwise indicated. For all experiments, littermate, co-housed and 
sex-matched mice were used. Both male and female mice were used 
for experiments. No mice were excluded from the analyses performed. 
End points used for mice developing tumours were changes in activity 
or mobility, abnormal posture, decreased food and/or water intake 
and decreased body temperature. Experiments in BSRC ‘Alexander 
Fleming’ were approved by the Institutional Committee of Protocol 
Evaluation in conjunction with the Veterinary Service Management 
of the Hellenic Republic Prefecture of Attika according to all current 
European and national legislation. All animal experimentation at Yale 
was performed in compliance with Yale Institutional Animal Care and 
Use Committee protocols. 


Generation of Ptgs2 Lox-stop-Lox-knockin mice 

Ptgs2 flox-stop-knockin mice were generated by the University of 
California, Davis Mouse Biology Program services. JMB8 (C57BL/6N) 
embryonic stem (ES) cells were targeted with a vector containing a 
diphtheria toxin A (PGK-DTA) cassette, a4 kb 5’ arm of homology, two 
loxP sites within intron 3 of the Ptgs2 gene flanking a STOP cassette 
sequence (derived from Addgene plasmid 11584), and a frt-flanked PKG- 
neomycin cassette anda5.1kb 3’ arm of homology. The PKG-neomycin 
element enabled positive selection in ES cells, while the DTA element 
enabled negative selection in ES cells. Mice bearing the targeted lox- 
stop-frt-PGK-neomycin-frt-lox Ptgs2 allele in the germline were crossed 
with the B6N(B6J)-Tg(CAG-Flpo)1Afst/Mmucd transgenic mouse 
(Mutant Mouse Resource and Research Center MMRC_036512-UCD) 
and the PGK-neomycin cassette was removed and mice bearing a lox- 
stop-frt-lox Ptgs2 allele (Ptgs2'*') were obtained (Extended 
Data Fig. 3i). 


Human study participants 

Fresh human colon tissue was obtained from the Yale Pathology 
Archives onthe basis of Yale Human Investigation Committee protocols 
no. 0304025173, which allows retrieval of tissue from surgical pathol- 
ogy that was consented or has been approved for use with waiver of 
consent. The data were analysed anonymously from preexisting patient 
databases and are thus exempt from consent by the human studies 
committee. Patient characteristics (sex, age, diagnosis) are described 
inthe Supplementary Table 3. All tissue segments were obtained from 
the uninvolved surgical margins of colon resections. The specific part of 
the colon resected is indicated inthe Supplementary Table 3. Ischaemic 
time of all samples ranged from 1h to 3 h. All collected samples were 
kept on ice-cold RPMI medium before processing. 


Formalin-fixed paraffin-embedded colorectal tumour tissue was 
obtained from the Yale Pathology Archives. The data were analysed 
anonymously from preexisting patient databases and hence exempt 
from consent by the human studies committee. Patient characteristics 
(sex, age, diagnosis) are described in the Supplementary Table 3. 


Isolation of human intestinal epithelial cells and stromal cells 
For the isolation of intestinal epithelial and stromal cells the tissue was 
cut into 0.5 cm pieces and incubated five times in HBSS containing 
0.5 mM EDTA and 1mM DTT for 15 min, at 4 °C ona rocker. Epithe- 
lial cells were released by vigorous shaking and passed through a 
70-um strainer, washed and used for RNA isolation. For stromal cell 
isolation the tissue pieces were incubated in DMEM containing 10% 
FBS, 300 U mI’ Collagenase XI (Sigma, C7657), 0.1 mg mI” Dispase Il 
(Sigma, D4693) and 50 U mI DNase Il Type V (Sigma, D8764) for 1h, 
at 37 °C, 200 rpm. Cells released after vigorous shaking were passed 
through a 70-um strainer, treated with ammonium-chloride-potassium 
red-blood-cell-lysing buffer, washed with 2% sorbitol and then used 
for RNA isolation. 


Human colonic organoid culture 

For the isolation of human colonic crypts the tissue was cut into 
0.5-cm pieces and incubated six times in PBS containing 5 mM EDTA 
and1mMDTT for 10 min, at 4 °C onarocker. Epithelial cells and whole 
crypts were released by vigorous shaking. The fractions enriched for 
crypts were further processed. Crypts were washed by centrifuga- 
tion at 100g, 50g and 30g and then used for organoid development in 
domes made by Matrigel (Corning, 356231) and IntestiCult Organoid 
Growth Medium (Human) (Stem Cell Technologies, 06010) accord- 
ing tothe manufacturer’s guidelines. When indicated, 16,16-dimethyl 
PGE, (Cayman, 14750) dissolved in ethanol was added daily at a final 
concentration of 0.1 pM. Ethanol was used as a vehicle control for 
the untreated organoids. The ONO-AE3-208 Ptger4 (EP4) inhibitor 
(Cayman, 14522) dissolved in DMSO was added ata final concentration 
of 10 uM Lhbefore stimulation and DMSO was usedas a vehicle control. 


Isolation of mouse intestinal epithelial cells and mesenchymal 
cells 

The intestine was dissected, flushed, opened longitudinally and then 
cut into 1cm pieces. The tissues were incubated in HBSS containing 
1mM EDTA, I mM DTT, 0.2% FBS, 4-5 times, 10 min each, at 37 °C, 
200 rpm. Epithelial cells were released by vigorous shaking, passed 
through a 70 pum strainer, washed and immediately lysed for RNA 
isolation. After epithelial cell removal, the remaining stromal part 
of the intestine was lysed for RNA isolation. For Drop-seq analysis or 
for FACS-sorting of mesenchymal cells the tissues were processed as 
above and then incubated in DMEM 10% FBS containing Collagenase 
XI (300 units/ml, Sigma, C7657), Dispase II (0.1 mg/ml, Sigma, D4693) 
and DNase II Type V (50 units/ml, Sigma, D8764) for 1h, at 37 °C, 
200 rpm. Cells released after vigorous shaking were passed through 
a 70 pm strainer and washed with 2% sorbitol. Such cell preparations 
were directly processed by Drop-seq or by flowcytometry as described 
below. 


Mouse intestinal organoid culture and fibroblast/crypt 
organotypic co-culture 

Crypts were isolated from the last three fourths of the small intestine. 
The intestine was flushed, cut longitudinally and the villi were scraped 
off with a glass coverslip. The tissue was then cut into 0.5 cm pieces 
which were incubated in PBS containing 5 mM EDTA, 0.2% FBS for 
30 min at 4 °C ona rocker. Crypts were released by vigorous shaking 
and were passed througha 70 um strainer. Six fractions were obtained 
after vigorous shaking and the ones enriched for crypts were further 
processed. Crypts were washed by centrifugation at 200g, 100g and 50g 
and then used for organoid development in domes made by Matrigel 


(Corning, 356231) and IntestiCult Organoid Growth Medium (Stem Cell 
Technologies, 06005) according to manufacturer’s guidelines. When 
indicated, dmPGE, (Cayman, 14750) dissolved in ethanol was added 
daily at a final concentration of 0.1 1M. Ethanol was used as a vehicle 
control for the untreated organoids. Crypts isolated from Yap1*"“ mice 
were cultured in IntestiCult Organoid Growth Medium (Stem Cell Tech- 
nologies, 06005) supplemented with 0.5 pg ml recombinant mouse 
epiregulin (RnD 1068-EP-050) or co-cultured with intestinal fibroblasts 
in OGM without epiregulin. 

For the assessment of stem-cell activity, organoids or spheroids were 
dissociated into single cells by incubation at 37 °C in 0.25% trypsin-EDTA 
solution (Gibco, 25200056) diluted 1:1 with DMEM without serum. 
Numbers of live cells were counted after staining with trypan blue. 
In each experiment, the same number of live single cells per condi- 
tion (n = 3,000-11,000) were cultured in domes made by Matrigel 
(Corning, 356231) and OGM. 

Intestinal organoids were stimulated with dmPGE, at a final 
concentration of 0.1 pM. Ethanol was used as a vehicle control. The 
ONO-AE3-208 Ptger4 (EP4) inhibitor (Cayman, 14522) dissolved in 
DMSO was added ata final concentration of 10 uM 1h before stimula- 
tion. Verteporfin (Cayman, 17334) dissolved in DMSO was added at a 
final concentration of 1 1M 1h before stimulation. DMSO was used as 
a vehicle control for ONO-AE3-208 and Verteporfin. 

Fibroblasts were isolated from the small intestine of mice. The intes- 
tine was dissected, flushed, opened longitudinally and then cut into 
1-cm pieces. The tissues were incubated in HBSS containing 1 mM EDTA, 
1mM DTT and 0.2% FBS 4-5 times for 10 min each, at 37 °C, 200 rpm. 
Epithelial cells were released by vigorous shaking. Then, the tissues 
were incubated in DMEM 10% FBS containing Collagenase XI (300 Um", 
Sigma, C7657), Dispase II (0.1 mg ml“, Sigma, D4693) and DNase ll Type 
V(50UmI", Sigma, D8764) for 1h, at37 °C, 200 rpm. Cells released after 
vigorous shaking were passed through a 70-um strainer, washed and 
cultured in DMEM with 10% FBS. For co-culture experiments 2 x 10* 
fibroblasts were seeded in 48-well plates overnight. Freshly isolated 
crypts (n=500) were suspended in 1:1 Matrigel (Corning, 356231) and 
OGM and addedas an overlay onthe fibroblasts. Crypts and fibroblasts 
were co-cultured with OGM. When indicated, the ONO-AE3-208 Ptger4 
(EP4) inhibitor dissolved in DMSO was added to the co-cultures every 
second day ata final concentration of 10 1M. DMSO was usedasa vehicle 
control for the untreated co-cultures. 


Quantitative real-time PCR 

RNA was isolated with the TRIzol reagent (Thermo Fisher, 15596026) 
followed by DNase! treatment (Roche, 04716728001) or with the QIA- 
GEN RNA isolation RNeasy plus Mini Kit (QIAGEN, 74134) according 
to the manufacturer’s instructions. Reverse transcription was per- 
formed with the Maxima H Minus Reverse Transcriptase (Thermo 
Fisher, EPO751). RT-qPCR analyses were performed using iTaq 
Fast SYBR Green Supermix (Bio-Rad, 1725100) and a CFX96 Touch 
Real-Time PCR Detection System (Bio-Rad). Data were acquired and 
analysed with the CFX Manager software (Bio-Rad). Gene expression 
relative to acontrol sample was calculated with the RelQuant software 
(Bio-Rad Laboratories) by normalizing to B2m expression. Where 
indicated, relative expression (RE) to 82m was calculated as RE=24“. 
Primers used for human were B2M-F: ATGAGTATGCCTGCCGTG 
TG, B2M-R: CCAAATGCGGCATCTTCAAAC, PTGS2-F: TGT TGAAAAGTAG 
TTCTGGG, PTGS2-R: AAGCAGGCTAATACTGATAGG. Primers for mouse 
wereB2m-F:TTCTGGTGCT TGTCTCACTGA, B2m-R: CAGTATGT TCGGCTT 
CCCATTC, Ptgs2: QT00165347 (QIAGEN) and Ptgs2-F: TCCAACCTCT 
CCTACTACACCAG, Ptgs2-R: GGGTCAGGGATGAACTCTCTC, Ptger1- 
F: AAGTTTTGGATTCACTTCCC, Ptger1-R: GAAGGTGTTGAGATTC 
TTGG, Ptger2-F: CTTGCCTTTCACAATCTTTIG, Ptger2-R: ACCCA 
AGGGTCAATTATAGAG, Ptger3-F: CGCCGCTATTGATAATGATG, 
Ptger3-R: TTCT TAGCAGCAGATAAACC, Ptger4-F: GTGCGGAGATCCA 
GATGGITC, Ptger4-R: TCACCACGT TTGGCTGATATAAC, Ly6a-F: GAAAG 


AGCTCAGGGACTGGAGTGTT, Ly6a-R: TTAGGAGGGCAGATGGGTAA 
GCAA, Clu-F: GCTGCTGATCTGGGACAATG, Clu-R: ACCTACTCCCTTGAG 
TGGACA, /lIrn-F: GCTCATTGCTGGGTACTTACAA, /lIrn-R: CCAGACTTGG 
CACAAGACAGG, Cxcl16-F: CCTTGTCTCTTGCGTTCTTCC, Cxcli6-R: 
TCCAAAGTACCCTGCGGTATC, Msin-F: CTTAGTCT TGGGTGGATA, Msin- 
R: TCTTCTGTCTTACAGCCA, Yap1-F: GATGTCTCAGGAAT TGAGAAC 
and Yap1-R: CTGTATCCATTTCATCCACAC. 


Western blot 

Total protein was extracted with RIPA lysis buffer. Nuclear and 
cytoplasmic fractions were extracted with NE-PER Nuclear and 
Cytoplasmic Extraction Reagents (Thermo Fisher, 78833). Protease 
inhibitors (Thermo Fisher, 87786) and phosphatase inhibitors (Thermo 
Fisher, 78420) were added. Antibodies against pYAP (Ser127) (D9W2I) 
(Cell Signaling, 13008), YAP (D8H1X) (Cell Signaling, 14074) and TBP 
(D5C9H) (Cell Signaling, 44059) were used at a1:1,000 dilution in5% BSA, 
overnight. Antibodies against B-actin (clone C4, Santa Cruz sc-47778) 
were used at a 1:2,000 dilution in 5% BSA for 2h at room temperature. 


Immunofluorescence and imaging 

Two-photon microscopy was performed with a LaVision TriM Scope 
Il (LaVision Biotec) microscope equipped with a Chameleon Vision 
Il (Coherent) two-photon laser in the In Vivo Imaging Facility of Yale 
School of Medicine. Fresh whole-mount specimens of the small intes- 
tine and the colon of a Pdgfra“" mouse‘ were prepared and analysed 
immediately after mice were euthanized. 

Confocal imaging was performed with a Nikon-Ti microscope 
combined with UltraVox spinning disk (PerkinElmer) and data were 
analysed by using the Volocity software (PerkinElmer). Rosa26-tdTo- 
mato, Pdgfra-eGFP and Fgfr2-mCherry were detected by direct fluo- 
rescence. The tissues were dissected, fixed in 4% paraformaldehyde 
for 4 hat 4 °C, followed by incubation in 30% sucrose/PBS overnight 
at 4 °C. Tissue samples were frozen in OCT (Tissue-Tek) on dry ice and 
kept at -80 °C until sectioning. Sections 10 ppm thick were prepared 
with acryostat (Leica). After washing with PBS, sections were mounted 
with Fluoroshield histology medium containing DAPI (Sigma, F6057). 
For Lgr5S—eGFP and vimentin immunostaining, the terminal ileum ofa 
Lgr5-eGFP-IRES-creERT2 mouse™, was dissected, frozen and pro- 
cessed as above. The staining was performed with an Alexa Fuor 488- 
conjugated rabbit polyclonal GFP antibody at 1:200 (Thermo Fisher, 
A-21311) and an Alexa Fuor 647-conjugated rabbit monoclonal vimen- 
tin antibody at 1:200 (Cell Signaling, 9856), overnight at 4 °C. Sca-1 
immunostaining was performed in frozen tissue sections processed as 
above and stained witha rat monoclonal Alexa Fuor 647 antibody (E13- 
161.7, Biolegend 122517, 1:400) overnight at 4 °C. The number of Sca-1" 
areas was quantified in sections of small intestine prepared with the 
Swiss-roll technique and an EVOS FL Auto 2 Imaging System (Thermo). 
The evaluation was blinded to mouse genotype. 

Formalin-fixed paraffin-embedded tissue sections were depar- 
affinised, washed and antigen retrieval was performed by microwave 
heating in citrate buffer. For Cox-2 immunostaining an anti-COX-2 
rabbit polyclonal primary antibody was used (Cayman, 160126) at a 
1:150 dilution, overnight at 4 °C with an anti-rabbit Alexa Fuor 488 
secondary antibody at a1:1,000 dilution, for 2h at room temperature. 
Immunostaining for epithelial lineage markers was performed with 
conjugated antibodies against lysozyme (FITC-conjugated rabbit 
polyclonal, DAKO EC 3.2.1.17, 1:100) and Dclk1 (Alexa Fuor 647 rabbit 
monoclonal, EPR6085, Abcam ab202755, 1:400) and primary antibodies 
against chromogranin-A (rabbit polyclonal, Abcam ab15160, 1:300) and 
Olfm4 (rabbit monoclonal, D6YSA, Cell Signaling 39141, 1:300) followed 
by an anti-rabbit Alexa Fuor 488 secondary antibody as above. Immu- 
nostaining for Yap was performed with a rabbit monoclonal primary 
antibody (D8H1X, Cell Signaling 14074, 1:50, overnight at 4 °C), a goat 
anti-rabbit biotinylated IgG secondary antibody (Vector, 1:750) and 
Streptavidin—Alexa Fuor 488 (1:800). Colocalization of Yap-Alexa Fuor 
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488 and DAPI was detected with the colocalization mode of Volocity 
software (PerkinElmer). Immunostaining for laminin Al was performed 
with arat monoclonal antibody (AL-4, R&D MAB4656, 1:50) overnight 
at 4 °C and an anti-rat Alexa Fuor 594 secondary antibody at a1:1,500 
dilution, for 1h at room temperature. The numbers of positive cells 
for each epithelial marker per well-oriented crypt or crypt-villus unit 
were quantified in a blinded fashion. 


BrdU 

Administration of BrdU (Sigma) was performed intraperitoneally 
at a dose of 100 pg per g of body weight 2h before mice were euthanized. 
BrdU immunohistochemistry was performed in sections of formalin- 
fixed paraffin-embedded tissues with the BrdU In-Situ Detection Kit 
(BD Pharmingen). The sections were counterstained with haematoxylin 
and analysed with a Leica DMI6000B microscope equipped with the 
Leica Application Suite LAS v.2.7 software. The number of BrdU" cells 
per well-oriented crypt was quantified in a blinded fashion. 


Alkaline phosphatase 

Alkaline phosphatase activity was detected in deparaffinized 
sections with the Vector Red Alkaline Phosphatase Substrate Kit (Vector, 
SK-5100) in 200 mM Tris-HCl, pH 8.5 according to the manufacturer's 
instructions. The sections were counterstained with haematoxylin and 
mounted with DPX or with Cytoseal Xyl (Thermo). 


Insitu hybridization 

Insitu hybridization was performed using the C Multiplex Fluorescent 
Detection Kit v.2 (ACD Bio) according to the manufacturer’s instruc- 
tions. The colons of eight-week-old wild-type mice were excised, rolled 
up and immediately frozen in liquid nitrogen before embedding in 
Tissue-Tek OCT compound (Sakura Finetek). Sections with a thickness 
of 15 um were prepared for RNAscope analysis using a mouse Rspol 
probe (ACD Bio 401991), amouse Ppib positive control (ACD Bio 313911) 
and the bacterial DapB probe as a negative control (ACD Bio 310043). 
DAPI was used as a nuclear counterstain. 


Evaluation of tumorigenesis in Apc™”* mice 

Early tumorigenesis in Apc“”* mice was examined at the age of five 
weeks. The entire small intestine was collected and fixed in 10% neutral- 
buffered formalin solution as a Swiss roll. H&E-stained paraffin sec- 
tions were examined with a Nikon Eclipse E800 microscope or a Leica 
DMI6000B microscope. The number of microadenomas was quantified 
in sections stained for B-catenin (Fig. 2) or BrdU (Fig. 4). At the age of 
5.5 months, tumorigenesis in Apc“””* mice was evaluated in the small 
intestine and the colon. The small intestine was partitioned into three 
parts of equal length (duodenum, jejunum and ileum). The tissues were 
opened longitudinally and the number of macroscopic tumours was 
quantified. The opened small intestine was rolled, fixed in formalin and 
H&E-stained paraffin sections were obtained. Pictures of all adenomas 
detected per section were obtained and their maximal diameter was 
measured by using the Image) software or the Leica Application Suite 
LAS v.2.7. All analyses were blinded to mouse genotype. 


Azoxymethane-induced colon tumorigenesis 

Mice were injected intraperitoneally with 10 mg kg“ of azoxymethane 
(Sigma, A5486) once per week for 10 weeks starting at the age of 6 
weeks. Mice were euthanized at the age of 28 weeks. Dysplasia develop- 
ment and adenoma formation were evaluated in H&E-stained paraffin 
sections of the colon. RNA was extracted from formalin-fixed, paraffin- 
embedded tissues with the RecoverAll Total Nucleic Acid Isolation Kit 
(Thermo Fisher, AM1975). 


Prostanoid analysis by H#PLC-MS/MS 
Prostanoids were extracted with acetone followed by liquid/ 
liquid extraction as previously described*°, with some modifications. 


In brief, 1O-SO mg of the ileum was homogenized in 500 pI PBS spiked 
with 100 pM butylated hydroxytoluene on ice. PGE,-d4 (Cayman, 
no. 314010) and PGD,-d4 (Cayman, no. 312010) were used as internal 
controls in each sample from the beginning of the extraction procedure 
at a final concentration of 10 ng mI. The samples were deprotein- 
ized with acetone. After mixing for 4 min and centrifugation at 2,000g 
for 10 min at 4 °C, the samples were transferred to clean 15-ml glass 
tubes, mixed with 800 pl hexane for 30 s and centrifuged for 10 min 
at 2,000g at 4 °C. The lower phase was acidified to pH 3.5 with formic 
acid and then mixed with chloroform. After mixing for 30 s and cen- 
trifugation for 10 min at 2,000g at 4 °C, the lower chloroform phase 
was evaporated to dryness under a stream of nitrogen and redissolved 
in 50 pl of methanol. HPLC-MS/MS analysis was performed using a 
modification of a method previously described”. From each sample 
avolume of 5 pl was injected into a Gemini 5 pm C18 110 A, 100 x 2mm 
HPLC column (Phenomenex, 0OD-4435-BO) coupled with an Agilent 
6490 QQQ Triple Quadrupole mass spectrometer with electrospray 
ionization in negative mode (Yale West Campus Analytical Core). The 
flow rate was 0.2 ml min“ and the column was maintained at ambient 
temperature. The analysis was performed using an acetonitrile-based 
gradient system mixing two solvents: solvent A was acetonitrile/water/ 
glacial acetic acid, 45/55/0.02 (v/v/v); solvent B was acetonitrile/water/ 
glacial acetic acid, 90/10/0.02 (v/v/v). The analytes were separated 
using the following gradient: 0.0-8.0 min, 0% solvent B; 8.0-8.1 min, 
0 to 50% solvent B; 8.0-12.0 min 50% solvent B; 12.0-12.1 min, 50 to 
70% solvent B; 12.1-20.0 min 70% solvent B; 20.0-20.1 min, 70 to 0% 
solvent B; 20.1-30.0 min 0% solvent B. The capillary voltage was set 
at 3,500 V, source temperature at 120 °C, desolvation temperature 
at 360 °C and cone voltage at 35 V. The detection of prostanoids was 
based on the multiple reaction monitoring (MRM) method. The transi- 
tion of precursor masses to specific fragments was monitored using a 
collision energy of 25-30 eV. PGE, and PGD, which havea similar MRM 
mass transition (m/z 351 > 271) and PGE,-d4/PGD,-d4 which also have 
similar MRM mass transition (m/z 355 > 275) were distinguished onthe 
basis of their different elution time from the HPLC column. The MRM 
mass transition for PGF, was m/z 353 > 193 and for PGI, was m/z 351 
> 215. The data were analysed with the Agilent MassHunter Worksta- 
tion software, v.B.07.00. For each mass transition the area under the 
curve was normalized with that of the corresponding internal labelled 
control and a relative abundance was calculated. The relative abun- 
dances calculated were normalized based on the weight of the tissue 
sample. PGD,-d4 was used as a control for PGD, and its metabolites. 
PGE,-d4 was used as a control for the rest of the prostanoids. 


Single-cell RNA sequencing and data analysis 
Single-cell RNA-seq was performed with the Drop-seq protocol as 
described previously**”” with minor modifications. Drop-seq analysis of 
mesenchymal/lamina propria cells isolated from the middle and distal 
colon of wild type mice was performed in two biological replicates. 
For each biological replicate, the colons of n=2 mice were pooled. The 
vast majority of intestinal epithelial cells were depleted by EDTA treat- 
mentas described above. N=5 Drop-seq collections were processedin 
total, two fromthe first biological replicate and three from the second 
(Extended Data Fig. le). Drop-seq analysis of Ptger4-OFF and Ptger4-ON 
crypt/fibroblast co-cultures was performed on day 4 of the protocol 
in one pool of six Ptger4-ON co-cultures and one pool of six Ptger4- 
OFF co-cultures with one Drop-seq collection per pool. For Drop-seq 
analysis of crypts, epithelial cells isolated from the small intestine of 
Ptger4 and Ptger4“"“ mice, tissues from n=2 mice per genotype were 
independently processed as biological replicates. A total of three Drop- 
seq collections were processed for each genotype, two from the first 
biological replicate and one from the second (Extended Data Fig. 8c). 
The cells were diluted to a concentration of 100 cells per pl and 
1-ml aliquots were used as input for each collection of the Drop- 
seq protocol***’. The beads were purchased from ChemGenes 


(no. Macosko201110) and the polydimethylsiloxane co-flow microflu- 
idic droplet generation device was generated by Nanoshift. Samples 
were processed for cDNA amplification within ~15 min of collection. 
Populations of 5,000 beads (~150 cells) were separately amplified for 
15 cycles of PCR and pairs of PCR products were co-purified by the addi- 
tion of 0.6x AMPure XP beads (Agencourt). Libraries were prepared and 
tagmented by Nextera XT using 1,000 pg of cDNA input, the custom 
primer PS_TSO Hybrid and Nextera XT primers N701-N705 (Illumina). 
Libraries from intestinal mesenchymal cells and crypt epithelial cells 
were sequenced on the Illumina HiSeq platform (paired end, 2 x 150 bp) 
and libraries from crypt-fibroblast co-cultures on the Illumina NextSeq 
500 platform (paired end, read 120 bp; read 2 60 bp), using a Read- 
1CustomSeqB* primer for read 1. 

Single-cell RNA-seq data were processed as described** to generate 
a digital expression matrix with transcript count data. This matrix 
was filtered retaining cells with more than 1,000 transcripts and less 
than 10% mitochondria transcripts. We then log transformed each 
entry of the matrix by computing log(TPM/100 + 1), where TPM is 
transcripts per million (meaning that the sum of all gene levels is equal 
1,000,000). After normalization, we used adaptively thresholded low 
rank approximation (ALRA)* to impute the matrix and fill in the techni- 
cal dropped-out values. Subsequently, to visualize the cell subpopu- 
lations in two dimensions, we applied principal component analysis 
followed by t-SNE“, a nonlinear dimension reduction method, to the 
log-transformed data. DBSCAN“ and graph-based clustering (Seurat, 
Satija lab) were then used to generate clusters that were overlaid onthe 
t-SNE coordinates to investigate cell subpopulations. Marker genes 
for each cluster of cells were identified using the Wilcoxon test with 
Seurat. Pathway enrichment analysis was performed by GSVA* and 
Pvalues were calculated with the moderated t-test implemented inthe 
Limma R package. For the adjusted P values the false discovery rate 
(Benjamini—Hochberg) correction method was used. 

Inthe fibroblast-crypt co-culture, single-cell RNA-seq experiment, 
epithelial cells were distinguished from fibroblasts and selected onthe 
basis of unbiased clustering and known marker genes as shown in the 
Extended Data Fig. 4d. Cell-cycle analysis was adapted from Seurat. 
First, ascore was calculated for each cell on the basis of the expression 
of G2M and S phase markers. Then, a discrete classification of cell cycle 
was assigned to each cell by comparing its G2M and S scores. Cells 
expressing neither were classified into the G1 group because they are 
less likely to be cycling. 

Ametagene score was assigned on the basis of publicly available bulk 
and single-cell RNA-seq datasets. For each of these datasets we selected 
significantly differentially expressed genes and constructed a meta- 
gene defined as weighted average of the log-transformed expression 
of these differentially expressed genes with weights equal to the log 
fold ratio of these genes in the respective dataset. More specifically, if 
we assume we have a metagene M that contains m genes: {gene,, gene,, 
..., gene,,} and each gene ihas log fold change (FC,) in the data we use 
for the signature of interest, and each geneihas an expression value of 
Xeene in a given cell in our dataset, then the score for M in this specific 


gs 
cell is calculated as: 


m 
Su = » Xgene i* log FC; 


i=l 


Each cell from our single-cell dataset was characterized by a score 
associated with each of the metagenes. The extent of differential behav- 
iour between distributions of the total cell populations of two different 
conditions (Ptger4-ON vs Ptger4-OFF cells) was assessed for each meta- 
gene using the Kolmogorov-Smirnov test. We built metagenes for stem 
cells, enterocytes, Paneth cells, goblet cells, enteroendocrine cells and 
tuft cells by using lists of population-specific genes based upon plate 
single-cell RNA-seq data from the mouse intestinal epithelium’. We 
also built the following metagenes: (1) a B-catenin program metagene 


based on bulk RNA-seq data from organoids bearing a murine stabilized 
mutant Ctnnb1 transgene and normal organoids (GSE93947), (2) A 
Yap program metagene based on bulk RNA-seq data from Yap-over- 
expressing and normal crypts isolated from doxycycline-treated and 
untreated Yap7g inducible transgenic mice respectively (GSE66567)'%, 
(3) Anearly Apc" tumorigenesis program metagene based on micro- 
array gene-expression data from the nonpolypotic sections of terminal 
ileum from Apc“ and wild-type mice (GSE49970). 

Single-cell RNA-seq data of the healthy human colonic mesenchyme? 
were obtained from GSE114374. Single-cell RNA-seq data of regenerat- 
ing intestinal crypts’® were obtained from GSE117783. These datasets 
were processed by ALRA and the subsequent steps as above. 


Abdominal irradiation of mice 

For abdominal X-ray irradiation an X-RAD 320 Biological Irradiator 
(Precision X-ray) was used (Research Irradiator Facilities, Department 
of Therapeutic Radiology,Yale School of Medicine). Mice were anaes- 
thetized and irradiated individually. 15 mm-thick lead was used for 
head, limb and tail shielding. Mice were treated at a distance of 50cm 
from the radiation source with 320 kV, 12.5 mA X-rays, using a filter 
consisting of 2.0 mm Al. The mouse abdomen was centred between a 
55 mm x 65 mm target window outlined by an adjustable collimator. 
The dose rate was measured with an ionization chamber by members 
of the Radiation Safety Division at Yale University. The abdominal 
dose rate was 235 cGy min“. Average dose to shielded areas was 3.54 
cGy min. 


Flow cytometry and sorting 

Freshly isolated stromal cells from Col6-cre-Rosa264"""" mice were 
stained with monoclonal antibodies against CD45 (Biolegend) and used 
for FACS sorting. FACS sorting was performed at the Yale Flow Cytom- 
etry Facility with a BD FACSAria II sorter equipped with FACSDiva 7 
software. Freshly isolated stromal cells from Col6-cre-Rosa26"""""Ptgs2’? 
mice were sorted on the basis of their GFP and tdTomato fluorescent 
protein expression with a BD FACSAria III sorter (BD) equipped with 
FACSDiva software at the Flow Cytometry Facility of BSRC Fleming. 
Single-cell suspensions from organoid cultures and co-cultures were 
obtained as described above, stained with monoclonal antibodies 
against Cd24 (Clone M1/69, Biolegend) and Sca-1 (Clone D7, Biolegend) 
and analysed at the Yale Flow Cytometry Facility with a BD LSRII cytom- 
eter equipped with FACSDiva software. Data analysis was performed 
with the FlowJo software. 


Statistical analysis 

Statistical analyses were performed with GraphPad Prism 7.01. 
Normality was tested with the Shapiro-Wilk Wtest. For W< 0.05, differ- 
ences in means were tested for statistical significance with two-tailed 
Mann-Whitney test or Kruskal-Wallis test. For W> 0.05, variances 
were compared by F test and if similar (F test, P> 0.05), unpaired 
two-tailed Student’s t-test or one-way ANOVA was applied, otherwise 
(F test, P< 0.05) unpaired two-tailed Welch’s t-test was applied. For 
paired comparisons, statistical significance was tested with paired 
t-test for W> 0.05 or with Wilcoxon matched-pairs signed-rank test 
for W<0.05. Pvalues <0.05 were considered as statistically significant. 
Survival curves were compared by the log-rank test using GraphPad 
Prism 7.01. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All data that support the findings of this study are available within 
the paper and its Supplementary Information files. All Drop-seq data 
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that support the findings of this study have been deposited in the 
Gene Expression Omnibus (GEO) repository with the accession code 
GSE142431. 


Code availability 


The code used for single-cell RNA-seq data analysis is available in GitHub 
(https://github.com/KlugerLab/Scripts Roulis_et_al_2020). 
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Extended Data Fig. 1| Mesenchymal Ptgs2 expression inthe 
microenvironment of crypt stem cells. a, Transmission electron microscopy 
photograph of the base of a mouse ileum crypt. P, Paneth cell; S, columnar basal 
stem cell; F, fibroblast. Scale bar, 5 ym. Indicative of independent observations 
in two experiments. b, Immunostaining for LgrS—eGFP and Vimentin inthe 
ileum of anLgr5-eGFP-IRES-creERT2 mouse. Scale bar, 20 pm. Indicative of 
independent observations in one experiment. c, PTGS2 relative gene 
expression (RE) inintestinal epithelial cells (IECs) and stromal cells isolated 
from healthy human colonic tissues (n= 6 individuals). Statistical comparison 
performed with two-tailed Wilcoxon matched-pairs signed-rank test. d, Ptgs2 
gene expression in IECs and stromal cells isolated from the ileum and the colon 
of wild-type mice (n= 4). Statistical significance was determined by two-tailed 
paired f-test. e, Biological replicates of the Drop-seq experiment shownin 

Fig. 1a visualized on the respective ¢-SNE plot depicting n =3,179 mesenchymal 
cells. Mesenchymal cells were independently isolated from two groups of 
wild-type mice (biological replicates 1 and 2). From each of these isolations up 
to three independent Drop-seq samples were collected (A to C) fora total of 


five samples. f, All Ptgs2-expressing single cells (n =1,136) detected inthe 
experiment shown in Fig. la, c were analysed separately and re-clustered. 
Cluster annotations are visualized ona t-SNE plot. Violin plots display the entire 
distribution of gene expression levels per single cell in each cluster for key 
mesenchymal marker genes. F, fibroblasts. g, Schematic representation of the 
arachidonic acid metabolism pathway. For each mesenchymal cluster shown in 
Fig. 1a, violin plots display the entire distribution of gene expression levels per 
single cell for six genes involved in the metabolism of arachidonic acid to 
prostanoids. Data fromn=3,179 single mesenchymal cells are shown. 

h, Analysis of single-cell RNA-seq data (GSE11434) from the healthy human 
colonic mesenchyme’. Clustering results for n=4,348 cells and cluster 
annotations are visualized onat-SNE plot. The annotations of stromal 
populations are matched with the ones reported by Kinchen et al.? on the basis 
of the respective markers. Expression levels of PTGS2 per single cell are 
visualized on at-SNE plot. The entire range of gene expression levels per single 
cell for PTGS1, PTGS2 and key mesenchymal marker genes is displayed in violin 
plots. Data are mean+s.e.m.;ns, non-significant; *P< 0.05; **P<0.01. 
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Extended Data Fig. 2 | Location of major fibroblast populations inthe 
mouse intestine. a, Detection of Pdgfra-expressing mesenchymal cellsin the 
intestine of adult Pdgfra-H2B-eGPF-knockin mice’. Two distinct populations of 
Pdgfra™®" and Pdgfra'™ mesenchymal cells were detected in fixed tissue 
sections by direct eGFP fluorescence (green) and confocal microscopy. Nuclei 
are stained with DAPI (blue). Pdgfra"®" cells are located under the epithelium 
along the crypt-villus axis and in the muscularis propria. They form clusters at 
the tips of villiand the apical part of the colonic mucosa. Pdgfra'™ cells are 
located inthe inner part of the villi, the pericryptal area and the submucosa. 
Filled arrows indicate pericryptal Pdgfra'™ cells. Open arrows indicate 
subepithelial Pdgfra"®" cells. M, mucosa; V, villus; SM, submucosa; MP, 
muscularis propria. Scale bars, 20 um. Data are representative of six 
independent experiments. 

b, Detection of Pdgfra’*" and Pdgfra'™ fibroblasts in the fresh, intact intestine 
of adult Pdgfra-H2B-eGPF-knockin mice* by two-photon microscopy. The cells 
were detected by direct eGFP fluorescence (green). Pdgfra"*" cells are 
predominant in the muscularis propria, whereas Pdgfra'™ cells are 
predominant in the submucosa. Both populations are present in the mucosa. 
Data are representative of independent observations from one experiment. 
Scale bars, 100 pm. c, Detection of Pdgfra"*" and Pdgfra™ fibroblasts in the 
intestine of Pdgfra-H2B-eGPF-knockin embryos on embryonic day 15 (E15.0) 


and in early postnatal development. E15.0: clusters of Pdgfra"®" cells in early 
villi are indicated by white arrows. Pdgfra'” mesenchymal cells occupy the rest 
of the mesenchyme (asterisks). PO: Pdgfra"®" cells are observed in the villi (V) 
and Pdgfra’™ cells are observed both in the villi and in the rest of the 
mesenchyme (asterisks). P15: Pdgfra’™ cells surround an early crypt (C) and 
Pdgfra"®" cells are located at the edges of the crypt (open white arrows). 
Pdgfra' cells occupy the inner mesenchyme (asterisks). Data are 
representative of independent observations from one experiment per 
developmental stage. Scale bars, 20 um. d, The location of Fgfr2-expressing 
mesenchymal cells was determined in the intestine of an Fgfr2-T2A-H2B- 
mCherry-knockin mouse’, by detecting direct mCherry fluorescence (red) in 
the nucleus (blue, DAPI). The arrows indicate pericryptal Fgfr2° fibroblasts. 
Data are representative of independent observations from one experiment. 
Scale bars, 20 pm. e, Immunostaining for laminin Al (encoded by Lamaz1), the 
epithelial marker E-cadherin and the mesenchymal marker vimentin inthe 
normal mouse intestines shows that laminin Alis detected specifically 

at the mesenchymal-epithelial interface. Data are representative of two 
independent experiments. Scale bars, 5 um. f, In-situ hybridization analysis 
showing the location of Rspol-expressing cells inthe normal mouse colon. 
The position of Rspol-expressing cells along the crypt axis was quantified in 
40x 80 um? sub-epithelial areas at the base, middle and top sections of n=9 
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Extended Data Fig. 3 | See next page for caption. 
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Extended Data Fig. 3 | Mice with fibroblast-specific ablation or fibroblast- 
restricted expression of Cox-2. a, Immunofluorescence of ileum and colon 
sections from Col6-cre-Rosa26"** mice (scale bar, 20 1m) and of asmall 
intestinal tumour section from an Apc™"”*-Col6-cre-Rosa26""* mouse 
(scale bar, 150 pm). Data are representative of two experiments. b, Efficiency of 
Col6-cre-mediated recombination of alox-stop-lox tdTomato reporter in 
Pdgfra™®" and Pdgfra'™ Cd45 cells determined by flow cytometry inintestinal 
mesenchymaland lamina propria cells isolated from the small intestine and the 
colon of Col6-cre-Rosa26@"°" Pdg fra’ mice in one experiment. c, Ptgs2 
relative gene expression (RE) in whole tissue, isolated IECs, FACS-sorted 
Col6Cre* fibroblasts (CD45 tdTomato’*) and Col6Cre’ mesenchymal cells 

(CD45 tdTomato ) from the small intestine of Col6-cre-Rosa26@""* mice 
(n=3, pooled). Representative of two experiments. d, Efficiency of Col6-cre- 
mediated Ptgs2 gene ablation in Col6-Cre* mesenchymal cells determined by 
RT-qPCR analysis of Ptgs2 expression in FACS-sorted Col6-cre* fibroblasts 
(eGFP*) from the small intestine of Col6-cre-Rosa26""”""°Ptgs2”* (n=3) and 
Col6-cre-Rosa26™""°Ptgs2 (n=3) mice. Unpaired two-tailed Welch's t-test. 

e, Expression of the Prgs2 gene in whole tissue ileum of littermate Ptgs2” and 
Ptgs2**"*' mice (n=7 each). Two-tailed t-test. f, Spleen weight of 5.5-month-old 
Apc™"*Ptgs2 (n= 8) and Apc™"* Ptgs2“"'>' (n= 6) mice. Average spleen weight of 
(n=6) normal littermates (Ptgs2”) is displayed for comparison. Two-tailed t- 
test. g, Survival analysis of Ape™””“Ptgs2” (n=12) and Apc™*Ptgs2“*""" (n=12) 
mice. A two-tailed P= 0.00009687 was calculated by log-rank test. h, Size of 
274 adenomas from 5.5-month-old Apc™"*Ptgs2” (n=16) and Apc™™*Ptgs2“Fer 
(n=18) mice. The whiskers extend from minimum to maximum and the box 


extends from the 25th to 75th percentiles with the median indicated. Two- 
tailed Mann-Whitney test. i, Generation of knockin mice bearing alox-stop-lox 
cassette insertion in intron-3 of the Ptgs2 gene which prevents its expression 
(Ptgs2°"). Col6-cre-mediated excision of the lox-stop-lox cassette reactivates 
Ptgs2 expression specifically in fibroblasts (Ptgs2"’’), The orange box depicts 
an frt site remaining from the flp-mediated removal of an frt-flanked PGK- 
neomycin selection cassette (see Methods). j, Ptgs2” (n=30) and Ptgs2“*" 
(n=24) mice were subjected to 10 weekly intraperitoneal injections with 

10 mg kg‘ azoxymethaneas displayed. Quantification of the number of 
dysplastic fociand microadenomas per mouse and quantification of tumour 
size is shown. Statistical significance was tested by two-tailed Mann-Whitney 
test. k, Quantification of intestinal epithelial populations inthe ileum of 
littermate Ptgs2” and Ptgs2“""" mice (n=3-5 per genotype). Immunostaining 
was performed for markers of Paneth cells (lysozyme), tuft cells (Dclk1), 
enteroendocrine cells (chromogranin A) and stem cells (Olfm4). Goblet cells 
were identified by periodic acid Schiff (PAS) staining and enterocytes were 
identified by detecting alkaline phosphatase enzymatic activity. Incorporation 
and immunohistochemical detection of BrdU was used to determine the 
numbers of cycling cells. Data for each mouse represent mean number of 
positive cells per crypt or crypt-villus unit as indicated. N=400-822 crypts 
and/or villi were evaluated per staining. Statistical comparisons were 
performed with two-tailed unpaired t-test except for Olfm4* cells for which 
unpaired t-test with Welch’s correction was applied. Scale bars, 50 pm. All data 
represent mean +s.e.m. unless otherwise indicated. ns, non-significant; 
*P<0.05,**P<0.01, ***P<0.001. 
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Extended Data Fig. 4| See next page for caption. 
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Extended Data Fig. 4| PGE,-driven spheroids contain more functional stem 
cells. a, Cryptsisolated from the small intestine of wild-type mice were grown 
into organoids by 3D culture with OGM or OGM that was supplemented daily 
with 0.1 1M16,16-dimethyl PGE, (dmPGE,). Indicative images and 
quantification of the absolute numbers of organoids and spheroids grown per 
3D structure are shown. n=6 3D cultures were evaluated per condition. Scale 
bar, 100 pm. b, Assessment of stem-cell activity in organoids or PGE,-driven 
spheroids grownas ina by dissociation into single cells and 3D culture in OGM. 
Growth of crypts and organoids from the same initial number of cells was 
quantified on day 14. The results are indicative of five independent 
experiments starting from independent crypt isolations. c, Normal crypts 
were grown into organoids with OGM ina3D co-culture with primary mouse 
intestinal fibroblasts with or without 10 pM ONO-AE3-208 (Ptger4/EP4 
inhibitor). Indicative images and quantification of the absolute numbers of 
organoids and spheroids grown per 3D structure are shown. n= 63D co- 
cultures were evaluated per condition. Scale bar, 200 pm. d, Separation of 
n=2,192 fibroblasts and epithelial cells in single-cell RNA-seq data from 
fibroblast-crypt organotypic cultures on the basis of the expression of key 


marker genes. Expression of intestinal epithelial marker genes (Epcam, Atp1b1 
and Krt8) and fibroblast marker genes (Sparc, Colla1 and Col3a1) in single cells 
from Ptger4-ON and Ptger4—OFF fibroblast-crypt co-cultures is shown 
projected onto¢-SNE plots. e-j, Single-cell data from Ptger4-ON and Ptger4— 
OFF fibroblast-crypt co-cultures as shown in Fig. 3d, visualized on the 
respective t-SNE plot depicting n =1,585 epithelial cells. e, Expression of 
epithelial population-specific signatures (metagenes) per single epithelial cell. 
Population signatures were calculated on the basis of single-cell profiling of the 
mouse intestinal epithelium”. f, Cell cycle analysis of single epithelial cells 
projected onto the ¢-SNE plot. g-j, Expression levels of metagenes for the 
signatures or transcriptional programs of RSC’® (g), B-catenin (h), Yap (i) and 
early (non-tumour) Apc“"* tumorigenesis (j) per single epithelial cell 
projected onto¢-SNE plots. k, Data from n=1,585 single epithelial cells, 
visualized in violin plots for each co-culture condition (Ptger4-ON or Ptger4- 
OFF). The entire range of metagene expression levels per single epithelial cell 
for the signatures or transcriptional programs of RSCs, B-catenin, Yap and early 
Apc™"* tumorigenesis is displayed. Ina, c, two-way ANOVA. Dataare 

mean +s.e.m.****P<0.0001. 
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Clusters 


Relative expression to B2m 


Relative expression to B2m 


(2 “Deltacty 


(2 DeltaCty 


b Ptger1 Ptger2 
ek 
Ptger2 305 ; a 
0.005 ** 
Ww Ww 
0.004 x 204 + wv 10] 2 
t N P=0.199 
0.003 8 - go — 
& x 
0.002 107), « Be : 
— az 
0.001 | a £ ° 
al ale Fw a eer oe 
o.oo LI ee Pe ee ak ee - ee 
EFF FCO ws fF AGS 
s ee SAK FOE ¢ eS ¢ rs ¢ RS ¢ RS 
FY Oe a 
s Sor 4 
re Small Colon Small Colon 
Intestine Intestine 
Ptger4 
re ig Ptger4 
* 
2.5 
0.004 ° 
0.003: 2.0 elo 
7] WwW WwW 
0.002 i oc Cast , 
im i 2 = 
9.001 & S 1.0 ° 
! Qa Q 
coool th lee Pes os] + 
EFF GW OcF ws FS dF & aller 
eve A NN Ww A wes Rs - Fe 
a Paes & Re 
FE PLS OS 0-9 0.0-——_+—_.—_ 
¢ * (eu > eP & (a > OP > 
g < < & & 
¢ 3 ¢ 3 x7 3 x7 
Small Colon Small Colon 
Intestine Intestine 
d 
PTGER1 PTGER2 PTGER3 PTGER4 
p=0.1339 p= 0.0360 p= 0.0424 p<0.0001 
5 25 10 40 
a o ° LA 
* 20 8 Ld 
4 
i 30 
3 s 15 6 s 
S a < es = < 20 
oa o 5 oa a 
“2 40 “4 Tm 
a e 
e % 10 
5 2 Ly 
agi sy 
a 
| 0 »| > Pg] te 
~ Re IN < > < 
a o oO o © 
SS & ce Sy 
we «cS we ws we “wy 
Ptger4 expression per cell population Ptger4 expression 
per condition 
an 3 
3 1s) 
2 
2 2 
£2 7) 
g o 
= Q 
ie} c 
3 4 2 
g a 
a oO 
& ik | = 
0 a 
Cluster: 1 2 3 4 5 6 7 8 9 101112131415 16 17 18 19 20 21 22 > > 
Condition: @ @@S@@eS8G88 86086 3 ee oe & 
Se S 
a ~ 
No 8 Se SS OCS 
? & CF SK SF IH 
oe © of g gs “s & & 
¢ > & AW 
RY S 
ee ¢Y 


Article 


Extended Data Fig. 5| Expression of PGE, receptors in mouse and human 
tissues. a, RT-qPCR analysis for Ptgerl, Ptger2, Ptger3 and Ptger4 genes across 
12 mouse tissues. Expression relative to B2mis displayed as 2“. Data 
represent one experiment. MLN, mesenteric lymph nodes. b, RT-qPCR analysis 
for Ptger1, Ptger2, Ptger3 and Ptger4 genes in isolated IECs and matched stromal 
fractions from the small intestine (ileum) and the colon of wild-type mice 
(n=4). Statistical comparisons were performed by two-tailed paired t-test. 

c, Expression levels of Ptgerl, Ptger2, Ptger3 and Ptger4 genes determined by 
RNA-seq in FACS-sorted intestinal epithelial cell populations in 2 or 3 biological 
replicates and displayed as FPKM (fragments per kilobase of transcript per 
million mapped reads). Data retrieved from the GSE83394 GEO dataset. 

d, Expression levels of the human PTGERI, PTGER2, PTGER3 and PTGER4 genes 
in matched normal colon and tumour tissues from colorectal cancer patients 


(n=41), determined by RNA-seq and displayed as FPKM. Data retrieved from 
The Cancer Genome Atlas for colon adenocarcinoma (TCGA-COAD dataset). 
Statistical comparisons were performed by two-tailed Wilcoxon matched-pairs 
signed-rank test. e, Analysis of single-cell RNA-seq data’® (GSE117783) from 
crypts isolated from the small intestine of normal mice (blue) and mice treated 
with 12 Gy irradiation (red). n= 6,644 single cells are visualized on ¢-SNE plots 
based onthe experimental condition (normal, n=2,882; irradiated, n=3,762) 
and the clustering results. Violin plots represent the entire distribution of 
Ptger4 expression levels per single cell in each cluster and in each condition. 
The annotations of epithelial populations are matched with the ones reported 
by Ayyaz et al.’° on the basis of the respective markers. b-d, Mean+s.e.m.; ns, 
non-significant; *P< 0.05; **P<0.01. 
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Extended Data Fig. 6| See next page for caption. 
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Extended Data Fig. 6| PGE,-Ptger4 drive the induction of Yap target genes 
in intestinal organoids. a, Volcano plot displaying the results of differential 
gene-expression analysis performed in single epithelial cells from Ptger4-ON 
and Ptger4—OFF fibroblast-crypt co-cultures (n=1,585). Yap1 and Yap target 
genes" are indicated. Moderated t-test with false-discovery rate (Benjamini- 
Hochberg) correction. b, Expression levels of the genes indicated in single 
epithelial cells from Ptger4-ON and Ptger4—OFF fibroblast-crypt co-cultures 
(n=1,585), projected onto ¢-SNE plots. c, Experimental setup for data shownin 
d,e. Crypts were grown into organoids or spheroids by 3D culture in OGM or 
OGM that was supplemented daily with 0.1 11M dmPGE, for 7 days. Gene 
expression levels were measured by RT-qPCR on day 7. d, Relative expression of 
Yap target genes (Lyéa, Clu, Ilirn, Msin and Cxcli6) in day 7 organoids and 
PGE,-driven spheroids developed from wild-type crypts. V=3 3D cultures per 
condition. Two-tailed Welch’s f-test. e, Relative expression of Yap target genes 
in day 7 organoids and PGE,-driven spheroids developed from crypts isolated 
from Ptger4” and Ptger4*"* mice. n=3 3D cultures per genotype and condition. 
One-way ANOVA. f, Correlation between the expression levels of metagenes of 
a Yap transcriptional program and an early (non-tumour) Apc™"* 
tumorigenesis transcriptional program in single epithelial cells (n=1,585) from 


the Ptger4-ON and Ptger4—-OFF fibroblast-crypt co-cultures of Fig. 3. g, Small 
intestinal crypts were grown into organoids or spheroids with OGM or OGM 
that was supplemented daily with 0.1114M dmPGE,. Western blot analysis for 
Yapland B-actin was performed in total lysates from untreated organoids, 
organoids treated with 0.14M dmPGE, for 16h and untreated spheroids. 

Data from one organoid and three independent spheroid cultures. h, Relative 
expression of the Yap1 gene and Yap target genes in wild-type organoid 
cultures treated with 0.1 uM dmPGE, for 13h, as determined by RT-qPCR. 
n=3-5 cultures per condition. Statistical comparisons were performed with 
unpaired two-tailed ¢-test. For Ly6a, Welch’s correction was applied. i, Western 
blot analysis for Serl27 pYap and total Yap performed in total lysates from wild- 
type organoids stimulated with 0.1 4M dmPGE, for the indicated time-points. 
Indicative of five independent experiments.j, Relative expression of Yap target 
genes in wild-type organoids treated with1 uM verteporfin and 0.114M dmPGE, 
for 13h.n=3-4 cultures per condition. Statistical comparisons were 
performed with unpaired two-tailed t-test, two-tailed Welch’s t-test or Mann- 
Whitney test on the basis of the criteria described in Methods. All data are 
mean +s.e.m. *P< 0.05, **P< 0.01, ***P< 0.001, ****P< 0.0001. 
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Extended Data Fig. 8 | See next page for caption. 


Extended Data Fig. 8| Ptger4 ablation does not affect epithelial lineage 
differentiation and stem-cell function. a, Ptger4 gene expressionincrypts 
isolated from the ileum of littermate Prger4” and Ptger4“"* mice (n=3 mice per 
genotype) andin organoids grown from these crypts (n=3 cultures per 
genotype) determined by RT-qPCR analysis. Two-tailed unpaired ¢-test. 

b, Single-cell RNA-seq (Drop-seq) was performed in crypt epithelial cells 
isolated from littermate Prger4” and Ptger4“"“ mice. Data for 2,439 single 
epithelial cells are shown inat-SNE plot. c, Biological replicates visualized ona 
t-SNE plot. Crypt epithelial cells were independently isolated from two groups 
of mice per genotype (biological replicates 1 and 2). From the first biological 
replicate, two independent Drop-seq samples were collected (A and B) fora 
total number of three samples per genotype. d, Clustering and cluster 
assignments of 2,439 single epithelial cells displayed onat-SNE plot. 

e, Proportion of each epithelial cluster among total crypt epithelial cellsin 
Ptger4” and Ptger4“" mice. f, Violin plots showing the entire range of 
expression levels for a metagene of the B-catenin transcriptional programin 
n=2,439 single epithelial cells from Prger4” and Ptger4*"* mice. g, Analysis of 
all Ptger4-expressing single cells detected (n= 478). Re-clustering results of 
Ptger4-expressing single cells with cluster annotations are visualized ona t-SNE 
plot. The expression levels of key marker genes for these clusters are visualized 


ont-SNE plots. h, Lineage tracing of Ptger4 heterozygous (Ptger4-HET) and 
Ptger4-knockout (Ptger4-KO) Lgr5* stem cells. The small intestines of 
Lgr5-creERT2-Rosa26"*°* Ptger4"* (Ptger4-HET) and Lgr5-creERT2- 
Rosa26t™°/* ptger4” (Ptger4-KO) mice were examined for direct tdTomato 
fluorescence 5 days after a single injection of 2 mg tamoxifen per mouse. The 
results shown are representative of independent observations from one 
experiment. Scale bars, 70 um. i, Quantification of intestinal epithelial 
populations in the ileum of littermate Prger4” (n=5) and Ptger4“"* (n=5) mice. 
Immunostaining was performed for markers of Paneth cells (lysozyme), tuft 
cells (Dclk1), enteroendocrine cells (chromogranin A) and stem cells (Olfm4). 
Scale bars, 20 pm. Goblet cells were identified by PAS staining and enterocytes 
were detected by alkaline phosphatase enzymatic activity. Scale bars, 50 um. 
Incorporation and immunohistochemical detection of BrdU was used to 
determine the numbers of cycling cells. Scale bars, 50 tm. Data for each mouse 
represent mean number of positive cells per crypt or crypt-villus unit as 
indicated. n=217-565 crypts and/or villi were evaluated per staining. Statistical 
comparisons were performed with two-tailed unpaired t-test except for PAS* 
cells, for which unpaired Welch's f-test was applied. Mean+s.e.m.; ns, non- 
significant; **P< 0.01. 
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Extended Data Fig. 9 | See next page for caption. 


Extended Data Fig. 9 | Nuclear localization of Yap and activation of Yap 
target genes in Apc” and azoxymethane-induced tumorigenesis. 

a, Immunostaining for Yap in the small intestine of five-month-old Apc” and 
wild-type littermate control mice. Nuclear localization of Yap is displayed on 
the basis of colocalization with DAPI. Normal (N) and tumour (T) areas of the 
Apc™"” intestine are indicated. Scale bars, 70 um. Dataare indicative of at least 
ten different tumour areas. b, Immunostaining for Sca-1and the epithelial 
marker E-cadherin in normal and tumour areas of the small intestine of 
five-month-old Apc™”” mice. Indicative of two independent experiments. 

c, Immunostaining for Yap in the colon of wild-type mice subjected to 10 weekly 
intraperitoneal injections with 10 mg kg ‘azoxymethaneas indicated and in 
untreated controls. Nuclear localization of Yap is displayed on the basis of 
colocalization with DAPI. Scale bars, 20 tm. Data indicative of three mice 
analysed. d, Relative expression of the Yap target gene Clu in normal and 


tumour areas of the colon of wild-type mice (n= 8) subjected to 10 weekly 
intraperitoneal injections with 10 mg kg tazoxymethaneas showninc. 
Two-tailed Mann-Whitney test. e, Relative expression of Yap1 and Yap target 
genes in the small intestine of 5-week-old Ptger4” (n=3), Apc" Ptger4” (n= 6) 
and Apc™"”* Ptger4"* (n= 8) mice. Statistical comparisons were performed with 
two-tailed t-test for Yap1 and /lIrn and with two-tailed Mann-Whitney test for 
Ly6a.f, Spleen weight of 5.5-month-old Ape™“Ptger4” (n=8) and 
Apc™"Ptger4“¢ (n=7) mice. Average spleen weight of (n=2) normal 
littermates (Ptger4”) is displayed for comparison. Two-tailed t-test. g, Size of 72 
adenomas from5.5-month-old Apce™“Ptger4” (n= 6) and Apc“ Ptger44*¢ 
(n=4) mice. The whiskers extend from minimum to maximum and the box 
extends from the 25th to 75th percentiles with the median indicated. Two- 
tailed Mann-Whitney test. Mean +s.e.m.; ns, non-significant; *P< 0.05; 
**P<0.01. 
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Extended Data Fig. 10|PGE,-PTGER4 controlsstem-cellfunctioninhuman _ localization of Yap is displayed on the basis of colocalization with DAPI. Clearly 
colonic crypts and YAP displays a nuclear localization in human colorectal defined normal (N) and tumour (T) areas are indicated wherever applicable. 


tumours. a, Human colonic crypts were grown into organoids by 3D culture Images shownare representative of specimens obtained and analysed from 
with OGM or OGM supplemented daily with 0.1 4M dmPGE,, with or without n=16 patients with the types of colorectal tumours indicated. Patient 

10 pM ONO-AE3-208 (PTGER4-EP4 inhibitor). Images indicative of three characteristics and the type of colorectal tumour per individual are described 
independent experiments with crypts isolated from three patients are shown. inthe Supplementary Table 3. c, Schematic representation of the mechanism 
Scale bar, 100 pm. b, Immunostaining for YAP in sections of human colorectal proposed inthe present study. TISC, tumour-initiating stem cell. 


adenomas or adenocarcinomas and neighbouring normal tissue areas. Nuclear 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
Lt AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Imaging data were acquired with the Volocity software (PerkinElmer) and the Leica Application Suite LAS V2.7 software. Mass 
Spectrometry data were collected with the Agilent MassHunter Workstation software, version B.07.00. Real-time PCR data were acquired 
and with the CFX Manager software (Bio-Rad). Flow cytometry data were acquired with the FACSDiva 7 software. 


Data analysis Imaging data were analyzed with the Volocity software (PerkinElmer) and the Leica Application Suite LAS V2.7 software. Adenoma size 
was measured in pictures obtained from H/E sections with the using the ImageJ software or the Leica Application Suite LAS V2.7 Mass 
Spectrometry data were analyzed with the Agilent MassHunter Workstation software, version B.07.00. Real-time PCR data were analyzed 
with the CFX Manager software (Bio-Rad). Relative gene expression was calculated with the RelQuant software (Bio-Rad Laboratories). 
Flow cytometry data were analyzed with the FlowJo V10 software. For statistical analyses and graphs we used GraphPad Prism 7.01 and 
R. scRNAseq data was analysed with Seurat, Bowtie-2, Picard, GSVA, the Limma R package and custom code which will be available upon 
request. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


All data that support the findings of this study are available from the authors upon request. 
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Drop-seq data have been deposited in the Gene Expression Omnibus (GEO) repository with the accession code GSE142431. 
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Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 
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Sample size No statistical methods were used to predetermine sample size for experimentation. For in vivo experiments sample size was determined 
based upon the availability of mice and was the largest possible and consistent with the known literature. For in vitro experiments with 
organoids n = 3-5 independent cultures were used per experiment which was sufficient because of the minimal variation observed and 
consistent with the known literature 


Data exclusions No data were excluded from our analyses 
Replication All experiments were independently reproduced as stated in the figure legends and all attempts of replication were successful. For 
organoid experiments in mice and humans independent experiments refer to fully independent cultures starting from different mice or 


different patients respectively 


Randomization There was no need for randomization in our experiments. Mice were analyzed based upon their genotype and no mice were excluded from 
the analyses. 


Blinding Macroscopic tumor count and all histological analyses (counting of microadenomas, assessment of phenotype after irradiation) were 
performed in a blinded manner. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
| | Palaeontology [| MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Phospho-YAP (Ser127), clone D9W2lI, Cell Signaling 13008, Ref 5/18 - lot: 5 and Ref 10/17 - lot: 5, dilution 1:1000 (Western blot); 
YAP, clone D8H1X, Cell Signaling 14074, dilution 1:50 (IHC-P), 1:1000 (Western blot); 
TBP, clone D5C9H, Cell Signaling 44059, Ref 4/18 - lot: 1, dilution 1:1000 (Western blot); 
beta-actin, clone C4, Santa Cruz sc-47778, lot D0615, dilution 1:2000 (Western blot); 
Akt (pan), clone C67E7, Cell Signaling 4691, Ref 1/19 - lot: 20, dilution 1:1000 (Western blot); 
Vimentin-Alexa Fuor 647 rabbit monoclonal, clone D21H3, Cell Signaling 9856, Ref 8/19 - lot 13, dilution 1:800 (IHC-P), 1:200 (IF); 
COX2 rabbit polyclonal, Cayman 160126, lot 0482857-1, dilution 1:150 (IHC-P); 
GFP-Alexa Fuor 488 rabbit polyclonal, Thermo Fisher A-21311, dilution 1:200 (IF); 
CD45.2- Pacific Blue™ rat monoclonal, clone 104, Biolegend 109820, lot: B249623, dilution 1:200 (flow); 
Lysozyme-FITC, rabbit polyclonal, DAKO EC 3.2.1.17, dilution 1:100 (IHC-P); 
Dclk1-Alexa Fuor 647 rabbit monoclonal, clone EPR6085, Abcam ab202755, lot GR229365-2, dilution 1:400 (IHC-P); 
Chromogranin A rabbit polyclonal, Abcam ab15160, lot ZZG021907A, dilution 1:300 (IHC-P); 
Olfm4 rabbit monoclonal, clone D6Y5A, Cell Signaling 39141, Ref 12/18 — lot: 1, dilution 1:300 (IHC-P); 
Ly6a/Sca-1-Alexa Fuor 647 rat monoclonal, clone E13-161.7, Biolegend 122517, lot: B249605, dilution 1:400 (IF); 
Ly6a/Sca-1-PE-Cy7 rat monoclonal, clone D7, Biolegend 108114, lot: B154904, dilution 1:100 (flow); 
Cd24 Brilliant Violet 421™ anti-mouse monoclonal, clone M1/69, Biolegend 101825, dilution 1:200 (flow); 
Laminin A1 rat monoclonal, clone AL-4, R&D MAB4656, dilution 1:50 (IHC-P); 


7 12q012Q 


810 


E-cadherin-FITC mouse monoclonal, clone 36/E-Cadherin, BD 612130, lot 8152975, dilution 1:200 (IHC-P) 


Validation All antibodies have been validated by the manufacturer for the species and the application to be used for as described in the 
data sheets provided. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals We used the Mus musculus C57BL6 strain (laboratory mouse). The generation of Ptgs2 lox-stop-lox knockin mice is described in 
this publication. Ptgs2 lox-stop-lox mice were generated and provided by Harvey R. Herschman, Col6Cre mice were provided by 
George Kollias, Ptgs2f/f mice were provided by Harvey R. Herschman, Ptger4f/f mice were provided by Richard M. Breyer, 
Fgfr2mCherry mice were provided by Philippe Soriano, ApcMin/+ mice, VillinCre mice, Lgr5-EGFP-IRES-creERT2 mice, 
Rosa26tdTomato/+ mice (Ai14) and PdgfraEGFP/+ were purchased from the Jackson Laboratories. All mice compared in 
experiments were littermates, co-housed and sex matched. Both male and female mice were used. Experiments were performed 
with 8-12 week-old mice unless otherwise stated in the figure legends (tumor experiments at 5 weeks and 5.5 months). 


Wild animals The study did not involve wild animals 
Field-collected samples The study did not involve samples collected from the field 
Ethics oversight All animal experimentation at Yale was performed in compliance with Yale Institutional Animal Care and Use Committee 


protocols. Experiments in BSRC “Alexander Fleming” were approved by the Institutional Committee of Protocol Evaluation in 
conjunction with the Veterinary Service Management of the Hellenic Republic Prefecture of Attika according to all current 
European and national legislation. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics Human subjects were both men (60%) and women (40%), aged between 23 and 87 years old (average 51.5 years old). Detailed 
human subject characteristics are provided in the Supplementary Table 3. 


Recruitment All human intestinal tissue samples (fresh or formalin-fixed paraffin-embedded) were derived from patients who underwent 
surgery at the Yale-New Haven Hospital for the conditions described in the Supplementary Table 3. Tissues were obtained 
anonymously from the surgical pathology services of the hospital (Yale Pathology Archives) based upon availability without bias 
in terms of sex and age 


Ethics oversight Yale Human Investigation Committee protocols #0304025173 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Clinical data 


Policy information about clinical studies 


All manuscripts should comply with the ICMJE guidelines for publication of clinical research and a completed CONSORT checklist must be included with all submissions. 


Clinical trial registration Provide the trial registration number from ClinicalTric NV or an equiv 
Study protocol Note where the full trial protocol can be accessed OR if not available, explair 
Data collection 
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Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 
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Methodology 


Sample preparation For intestinal lamina propria/mesenchymal cell isolation the intestine was dissected, flushed, opened longitudinally and then cut 
into 1 cm pieces. The tissues were incubated in HBSS containing 1 mM EDTA, 1 mM DTT, 0.2 % FBS, 4-5 times, 10 min each, at 37 
°C, 200 rpm. Epithelial cells were depleted by vigorous shaking. After epithelial cell removal the tissues were processed were 
incubated in DMEM 10% FBS containing Collagenase XI (300 units/ml, Sigma, C7657), Dispase I! (0.1 mg/ml, Sigma, D4693) and 
DNase II Type V (50 units/ml, Sigma, D8764) for 1 h, at 37 oC, 200 rpm. Cells released after vigorous shaking were passed 
through a 70 UM strainer and washed with 2% sorbitol. 

For organoid samples organoids were dissociated into single cell suspensions by incubation at 37 °C in 0.25% trypsin-EDTA 
solution (Gibco, 25200056) diluted 1:1 with DMEM without serum. 


Instrument FACS-sorting was performed at the Yale Flow Cytometry Facility with a BD FACSAria Il sorter equipped with FACSDiva 7 software 
and with a BD FACSAria III sorter equipped with FACSDiva software at the Flow Cytometry Facility of BSRC Fleming. 


Software Data were acquired with the BD FACSDiva 7 software and analyzed with the FlowJo V10 software. 


Cell population abundance _ All sorted cells per population were used for RNA isolation 
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Gating strategy Extended Data Figure 3c: FSC-A, SSC-A live cells >> SSC-W, SSC-H singlets >> FSC-H, FSC-W singlets >> Cd45-Tomato+, Cd45- 
Tomato- 
Extended Data Figure 3d: FSC-A, SSC-A live cells >> SSC-W, SSC-H singlets >> FSC-H, FSC-W singlets >> Cd45->>Tomato-GFP+ 
Extended Data Figure 3b: FSC-A, FSC-H singlets >> FSC-A, SSC-A live cells >> Cd45-Pacific blue-, Pdgfra-EGFP+ >> Col6Cre 
tdTomato 
Extended Data Figure 7b: FSC-A, FSC-H singlets >> FSC-A, SSC-A live cells >> FSC, Cd24+ >> FSC, Sca-1 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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The gluconeogenic enzyme PCK1 
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Cancer cells increase lipogenesis for their proliferation and the activation of sterol 
regulatory element-binding proteins (SREBPs) has a central role in this process. 
SREBPs are inhibited by acomplex composed of INSIG proteins, SREBP 
cleavage-activating protein (SCAP) and sterols in the endoplasmic reticulum. 
Regulation of the interaction between INSIG proteins and SCAP by sterol levels is 
critical for the dissociation of the SCAP-SREBP complex from the endoplasmic 
reticulum and the activation of SREBPs’”. However, whether this protein interaction is 
regulated by a mechanism other than the abundance of sterol—and in particular, 
whether oncogenic signalling has a role—is unclear. Here we show that activated 

AKT in human hepatocellular carcinoma (HCC) cells phosphorylates cytosolic 
phosphoenolpyruvate carboxykinase 1 (PCK1), the rate-limiting enzymein 
gluconeogenesis, at Ser90. Phosphorylated PCK1 translocates to the endoplasmic 
reticulum, where it uses GTP as a phosphate donor to phosphorylate INSIGI at Ser207 
and INSIG2 at Ser151. This phosphorylation reduces the binding of sterols to INSIG1 
and INSIG2 and disrupts the interaction between INSIG proteins and SCAP, leading to 
the translocation of the SCAP-SREBP complex to the Golgi apparatus, the activation 
of SREBP proteins (SREBP1 or SREBP2) and the transcription of downstream 
lipogenesis-related genes, proliferation of tumour cells, and tumorigenesis in mice. 
In addition, phosphorylation of PCK1 at Ser90, INSIGI at Ser207 and INSIG2 at Ser151is 
not only positively correlated with the nuclear accumulation of SREBP1in samples 
from patients with HCC, but also associated with poor HCC prognosis. Our findings 
highlight the importance of the protein kinase activity of PCK1 in the activation of 
SREBPs, lipogenesis and the development of HCC. 


® Check for updates 


INSIG proteins are anchor proteins of the endoplasmic reticu- 
lum (ER) that have two isoforms, INSIG1 and INSIG2?*. The bind- 
ing of cholesterol-derived oxysterols, including 22-, 24-, 25- and 
27-hydroxycholesterol, is crucial for the binding of INSIG proteins to 
SREBP cleavage-activating protein (SCAP), and for the retention of 
the SREBP-SCAP complex in the ER®*°. Under sterol-limiting condi- 


cancer cells, whether the interaction between INSIG proteins and the 
SREBP-SCAP complex is regulated by a mechanism that is independent 
of sterol abundance is unclear. 


AKT-phosphorylated PCK1 binds to INSIG1/2 


tions, the SREBP-SCAP complex is captured by COPII-coated vesicles 
and transported to the Golgi apparatus, where S1P and S2P proteases 
cleave SREBPs (SREBP1 or SREBP2) to yield active amino-terminal 
fragments for nuclear translocation and gene transcription’. In 


To investigate the regulation of INSIG proteins by oncogenic signal- 
ling, we treated Huh7 human HCC cells with insulin-like growth factor 
1(IGF1) for 1h to induce the signalling that is critical for HCC devel- 
opment’. Mass spectrometric analyses of immunoprecipitates of 
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INSIG1 and INSIG2 (Extended Data Fig. la, Supplementary Tables 1, 2) 
showed that IGF1 induced an association between INSIG1 or INSIG2 
(hereafter, INSIG1/2) and cytosolic PCK1 (also known as PEPCK1). PCK1 
is the rate-limiting enzyme of gluconeogenesis in the liver and kidney, 
and converts oxaloacetate and GTP into phosphoenolpyruvate and 
Co,°. Inhumans, cytosolic PCK1 shares 63.4% sequence identity with 
mitochondrial PCK2°. Co-immunoprecipitation and immunofluores- 
cence analyses showed that PCK1, but not PCK2, bound to INSIG1/2 
in IGF1-stimulated Huh7 and Hep3B HCC cells (Fig. 1a, Extended Data 
Fig. 1b, c), and that IGF1 induced the colocalization of PCKI1, but not 
PCK2, with INSIGI (Extended Data Fig. 1d). In addition, cell fractionation 
analyses showed that a small amount of PCK1, but not PCK2, translo- 
cated to the ER (Extended Data Fig. 1e, f). Thus, PCK1 translocates to 
the ER and binds to INSIGI/2 upon IGF1 stimulation. 

To determine the mechanism that regulates the interaction between 
PCK1and INSIG1/2, we inhibited the signalling pathway downstream of 
IGFlin Huh7 cells (Extended Data Fig. 1g). We found that treatment with 
the AKT inhibitor MK-2206 or expression of a dominant-negative AKT 
mutant (AKT-DN) blocked the IGF1-induced binding of PCK1to INSIG1/2 
(Fig. 1b, Extended Data Fig. 1h), the translocation of PCK1 to the ER 
(Extended Data Fig. 1i, j) and its colocalization with INSIG1 (Extended 
Data Fig. 1k). By contrast, expression of a constitutively active form of 
AKT (myr-AKT) induced the binding of PCK1, but not PCK2, to INSIGI/2 
(Extended Data Fig. 11). These results indicate that AKT induces the 
translocation of PCK1 to the ER, where it binds to INSIG1/2. 

Co-immunoprecipitation analyses showed that stimulating HCC cells 
with IGF1 induced an interaction between AKT and PCK1 (Extended 
Data Fig. Im). We also performed a pull-down assay that revealed that 
purified active glutathione S-transferase (GST)-tagged AKT1 bound 
directly to His-tagged PCK1 (Extended Data Fig. 2a) through its catalytic 
domain (as evidenced by the expression of different AKT1 truncation 
mutants; Extended Data Fig. 2b). An in vitro phosphorylation assay 
(Extended Data Fig. 2c) and liquid chromatography-tandem mass 
spectrometry (LC-MS/MS) analysis (Extended Data Fig. 2d) showed 
that active, but not inactive, AKT1 phosphorylated PCK1 at the evo- 
lutionally conserved residue Ser90 (Extended Data Fig. 2e) inan AKT 
phosphorylation motif RXXS/T” (Extended Data Fig. 2f). Mutation of 
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Fig. 1|IGF1-induced and AKT-phosphorylated PCK1 translocates tothe ER 
and binds to INSIG1/2. a—e, Immunoprecipitation (IP) and western blotting 
(WB) analyses were performed as indicated three times with similar results. 

a, Huh7 cells were treated with or without IGF1 for 1h. pindicatesa 
phosphorylated residue. b, Huh7 cells expressing Flag—PCK1 were pretreated 
with or without the indicated inhibitors for 30 min before treatment with or 
without IGF1 for 1h.c, In vitro kinase assays were performed by mixing purified 
wild-type (WT) His-PCK1 or His-PCK1(S90A) with or without purified GST- 
AKTl1inthe presence of [y-*P]ATP. d, Huh7 cells pretreated with or without 
MK-2206 for 30 min were treated with IGF1 for the indicated time periods. 

e, Parental Huh7 cells and the indicated clones with knock-in expression of 
PCK1(S90A) were stimulated with or without IGF1 for 1h. C1, clone 1; C2, clone 2. 


Ser90 in PCK1 to alanine (S90A) (Fig. 1c) or treatment with MK-2206 
(Extended Data Fig. 2g) abolished this phosphorylation, which was 
also detected using an antibody that is specific to PCK1 phosphoryl- 
ated at Ser90 (PCK1(pS90)) (Extended Data Fig. 2c, h). Treatment with 
IGF1 rapidly induced the phosphorylation of PCK1in Huh7 cells, and 
this was inhibited by pretreating cells with the AKT inhibitor MK-2206 
(Fig. 1d). An RNA-interference-resistant (r) PCK mutant (rPCK1(S90A)) 
expressed in endogenous PCK1-depleted Huh7 cells was resistant to 
IGF1-induced phosphorylation of PCK1 Ser90 (Extended Data Fig. 2i) 
and did not exhibit myr-AKT1- or IGF1-induced translocation tothe ER 
(Extended Data Fig. 2j, k). By contrast, the phosphorylation-mimicking 
PCK1(S90E) mutant accumulated in the ER without IGF1 stimulation 
(Extended Data Fig. 2k). Knock-in expression of PCK1(S90A) in HCC 
cells using CRISPR-Cas9-mediated genome editing (Extended Data 
Fig. 2I-n) blocked the IGF1-induced translocation of PCK1to the ER and 
its colocalization with INSIG1 (Extended Data Fig. 20, p). These results 
suggest that AKT1-mediated phosphorylation of PCK1 Ser90 is neces- 
sary and sufficient for the translocation of PCK1 to the ER. 

Of note, AKT-mediated phosphorylation of PCK1 (Extended 
Data Fig. 2q-s) or the introduction of an S90E mutation (Extended 
Data Fig. 2t-v) reduced the binding affinity of PCK1to oxaloacetate and 
its enzymatic activity (that is, the production of phosphoenolpyruvate). 
Thus, AKT-mediated phosphorylation of PCK1 and its translocation 
to the ER inhibit the canonical function of PCK1 in gluconeogenesis. 

To determine the role of the phosphorylation of PCK1 Ser90 in 
its binding to INSIG1/2, we mixed purified INSIG1/2 with wild-type 
GST-PCK1 or GST-PCK1(S90A). Only AKT-phosphorylated wild-type 
PCK1interacted with INSIG1/2 (Extended Data Fig. 2w). This interaction 
was abolished by treatment with calf intestinal alkaline phosphatase 
(CIP), which dephosphorylated PCK1(pS90) residue. Consistent with 
this, treatment with IGF1 induced the binding of INSIG1/2 to S pro- 
tein-Flag-streptavidin-binding-peptide (SFB)-tagged wild-type PCK1, 
but not SFB-PCK1(S90A) (Extended Data Fig. 2x). This binding was 
abrogated by treatment with CIP (Extended Data Fig. 2x) or by knock-in 
expression of PCK1(S90A) in HCC cells (Fig. le, Extended Data Fig. 2y). 
Thus, phosphorylation of PCK1 Ser90 is required for the binding of 
PCK1to INSIG1/2. 

To determine the role of INSIG1/2 in the translocation of PCK1tothe 
ER, we depleted INSIGI/2 in Huh7 cells, and found that this blocked the 
translocation of PCK1 to the ER that was induced by IGF1 or by myr-AKT1 
(Extended Data Fig. 2z). Expression of INSIG1/2 truncation mutants 
revealed that loop 1 of INSIG1/2 bound to PCK1 (Extended Data Fig. 3a). 
This interaction was not blocked in a PCK1(pS90) peptide (Extended 
Data Fig. 3b), which suggests that it is a conformational change in PCK1 
(mediated by phosphorylation at Ser90) that causes the binding of 
PCK1 to INSIGI/2, rather than the phosphorylation of Ser90 directly. 


PCK1 phosphorylates INSIG1/2 


AsPCK1is able to transfer a phosphate group from GTP toa metabolite, 
we next investigated whether PCK1 phosphorylates INSIG1/2. In the 
presence of radiolabelled [y-P]GTP and active AKTI, only purified 
wild-type PCK1 and not PCK1(S90A) or PCK1(C288S)—a kinase-dead 
mutant that could still be phosphorylated at Ser90 and interacted 
with INSIG1/2 (Extended Data Fig. 3c)—phosphorylated purified INSIG1 
(Fig. 2a) and INSIG2 (Extended Data Fig. 3d). By contrast, this phospho- 
rylation did not occur in the presence of [y-*P]ATP (Extended Data 
Fig. 3e). LC-MS/MS analyses showed that INSIG1 was phosphorylated at 
Ser207 (Extended Data Fig. 3f), which corresponds to Ser151 in INSIG2. 
Both of these residues are evolutionally conserved and are located inthe 
cytosolic loop 2 of INSIG1/2 (Extended Data Fig. 3g, h). INSIG1(S207A) 
(Fig. 2b) and INSIG2(S151A) (Extended Data Fig. 3i) mutants were not 
phosphorylated by PCK1 in vitro (as detected by an antibody that 
specifically recognizes both phosphorylated residues; Extended Data 
Fig. 3j, k). Compared to wild-type PCK1, PCK1(S90E) exhibited a much 


Nature | Vol580 | 23 April 2020 | 531 


Article 


a c Huh7 oS Hep3B ¢ 
, GST-AKT1 - - + + + + O* pe! J J 
= < ee pee ie PY ie 

eo cos) pa arerGsen His-PCK1WT + + - - + + INSIGV2 WS _gSlP gsi ys gsi os 

Se ociclicosas SFB-INSIG1 + - + - + - IGED Sab ep A 

is es oa esata SFB-INSIG1(S207A) - + - + - + iz = 
Se Giaiiads cides ce = 2) WB: INSIG1(pS207)/INSIG2(pS151) 

22P-INSIG1 a| @r eee weeereewe 
~ WB: INSIG1 
S2P-INSIG1 WB: INSIG1(p$207)/INSIG2(p$151) % = a 
6 Sa = ececcecee Q WB: INSIG1(p$207)/INSIG2(p$151) 
WB: PCK1(pS90) < 
a — WB: Flag (SFB-INSIG1) a| @e ewer ewreeece 
— —_ eecae > WB: INSIG2 
: His (| ) WB: GST (AKT1) - - - - - - 
ese © se® @ =e ea : 
: a WB: PCKi(pS90) 
WB: GST (AKT1) WB: PCK1 =| seaeea 
eweerece pi eecesoe ee 
WB: Flag (SFB-INSIG1) NS WB: FCK1 
d = we e e2e2eeee owwewws 
GST-AKT1 - + - + - + 222,, NS WB: Tubulin NS 
GST-PCK1(C288S) - - + + - - SBE °) . GST-AKT1 - - + + 3 — 
GST-PCK1WT + + - - - - Bos , GST-PCK1WT + + + + 6 3B 15) NS 
GST-PCK1(S90A) - - - - + + Soe * Flag/His-INSIG1 WT + - + - See ila : 
Flag/His-INSIG1 + + + + + + @ § 205 Flag/His-INSIG1(S207A) - + - + 2s. 
g w= & ge 0 <s = £ Ss 0.5 
5 S| WB: INSIG1(pS207/ oa E S| we: INSIG1(ps207)/ ass ? 
BZ} INSIG2(pS151) 50 3s] INSiG2(ps151) Eas 
Bs 
5 O|| = Flag/His- 37 “| _@e@e@e 50 
WB: INSIG1 INSIG] <i WB: INSIG1 37 
a ee es -- mMNSIGI 5p 
: GST-AKT1 - + - + - + : 25 
WB: PCK1(PS90)_ st-pcKi(c2888)  - - + + - — poe) GST-AKT - - + + 
Input|| << GST-PCKIWT +4 -- - - [as GST-PCK1WT + + + + 
WB: PCK1 GST-PCK1(S90A)_ - - - - + + sae) Flag/His-INSIG1 WT + - + - 
-= © @ Flag/His-INSIGT + + + + + + 7 Flag/His-INSIG1(S207A)  - + - + 
WB: AKT WB: AKT 


Fig. 2|PCK1 phosphorylates INSIG1 Ser207 and INSIG2 Ser151, thereby 
reducing the binding of oxysterols to INSIGI/2. a—e, Immunoblotting 
analyses were performed as indicated three times with similar results. 

a,b, Bacterially purified His-PCK1 proteins on Ni-NTA agarose beads were 
incubated with or without active GST-AKT1in the presence of ATP for an 

in vitro kinase assay. The beads were then washed and incubated with or 
without the indicated SFB-INSIGI proteins in the presence of [y-*P]GTP. 
Autoradiography was performed. c, Parental Huh7 and Hep3B cells and the 
indicated clones with knock-in expression of INSIG1(S207A)/INSIG2(S151A) 


higher velocity of enzyme-catalysed reaction at infinite concentration 
of substrate (V,,,,,) and lower Michaelis constant (K,,) in phosphoryl- 
ating an INSIG1 peptide at Ser207 (Extended Data Fig. 31). Notably, 
PCK1(S90E) or PCK1(S90D) mutants phosphorylated Ser207 of INSIG1 
and Ser151 of INSIG2 inthe absence of AKT (Extended Data Fig. 3m, n). 
In addition, INSIG1(S207A) and INSIG2(S151A), expressed in Huh7 cells, 
were resistant to phosphorylation that was induced by IGF1 stimula- 
tion or myr-AKT1 expression (Extended Data Fig. 30, p). In line with 
this, IGF1-induced phosphorylation of INSIG1/2 was also abolished 
by knock-in expression of PCK1(S90A) (Extended Data Fig. 3q), or by 
that of INSIGI(S207A)/INSIG2(S151A) (Fig. 2c, Extended Data Fig. 4a-d) 
in HCC cells. Thus, AKT-phosphorylated PCK1 functions as a protein 
kinase and uses GTP asa phosphate donor to phosphorylate INSIG1/2. 


INSIG1/2 phosphorylation reduces sterol binding 

We next examined whether phosphorylation of INSIGI/2 affects the 
binding of INSIG proteins to oxysterols. Phosphorylation of wild-type 
INSIG1 and INSIG2 by AKT-phosphorylated wild-type PCK1 (Fig. 2d, 
Extended Data Fig. 4e), PCK1(S90D) or PCK1(S90E) (Extended Data 
Fig. 4f, g)—but not PCK1(C288S) or PCK1(S90A)—reduced the bind- 
ing affinity of INSIG1/2 to [7H]25-hydroxycholesterol. By contrast, 
the binding affinity was unchanged for INSIG1(S207A) (Fig. 2e) and 
INSIG2(S151A) (Extended Data Fig. 4h). Similarly, wild-type INSIG1/2— 
but not INSIGI(S207A) or INSIG2(S151A) (Extended Data Fig. 4i,j)—that 
was immunoprecipitated from IGF1-treated parental Huh7 cells showed 
areductionin binding to [*H]25-hydroxycholesterol, whereas the bind- 
ing affinity of wild-type INSIG1/2 that was immunoprecipitated from 
Huh7 cells with knock-in expression of PCK1(S90A) was not affected 
(Extended Data Fig. 4k, |). This reduction in binding also occurred for 
the phospho-mimicking mutants INSIG1(S207E) and INSIG2(S151E) 
(Extended Data Fig. 4m, n). These results indicate that phosphoryla- 
tion of INSIG1/2 by PCK1 reduces the binding of INSIGI/2 to oxysterols. 
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double mutants (2S/A) were stimulated with or without IGF1 for 1h. d,e, Flag 
and His (Flag/His)-tagged wild-type INSIGI (d) or INSIG1(S207A) (e) 
immunoprecipitated and purified from Huh7 cells were incubated with the 
indicated GST-PCK1 proteins with or without active GST-AKT1 in the presence 
of ATPand GTP for Lh. The INSIG proteins on Ni-NTA agarose beads were 
washed and incubated with 400 nM [7H]25-hydroxycholesterol. Specifically 
bound [>H]25-hydroxycholesterol was measured (right) (n= 6). Dataare 

mean ts.d.NS, not significant (P=0.859, 0.930, 0.795, 0.768 (left to right) (d); 
P=0.720, 0.630 (left to right) (e)); **P< 0.001 (two-tailed t-test). 


The PCK1-INSIG1/2 axis activates SREBP 


The release of oxysterols from INSIGI/2 results in the disruption of 
the interaction between INSIGI/2 and SCAP, the translocation of 
the SCAP-SREBP1 complex from the ER to the Golgi apparatus and 
nuclear accumulation of SREBPI°. As expected, stimulation with IGF1 
disrupted the association between INSIG1/2 and SCAP (Fig. 3a, Extended 
Data Fig. 5a). This disruption was inhibited in Huh7 and Hep3B cells 
with knock-in expression of INSIG1(S207A)/INSIG2(S151A) (Fig. 3a, 
Extended Data Fig. 5a) or PCK1(S90A) (Extended Data Fig. 5b, c), or 
with reconstituted expression of rPCK1(S90A) (Extended Data Fig. 5d, 
e) or rPCK1(C288S) (Extended Data Fig. 5f). Cell fractionation analyses 
showed that IGF1 treatment induced the translocation of SCAP from 
the ER to the Golgi apparatus, and that this was inhibited by knock-in 
expression of INSIG1(S207A)/INSIG2(S151A) (Extended Data Fig. 5g, h) 
or PCK1(S90A) (Extended Data Fig. 5i, j). Expression of these mutants 
also blocked the IGF-induced loss of colocalization of SCAP with the 
ER protein calnexin (Fig. 3b, Extended Data Fig. 5k, n) and the colocali- 
zation of SCAP with the Golgi apparatus protein golgin-97 (Extended 
Data Fig. 51, m, 0). Similar results were obtained when we expressed 
Myc-tagged SCAP (Extended Data Fig. 5p, q). 

Stimulation with IGF1 increased the cleavage of SREBP1 (Extended 
Data Figs. 5r, 6a) and its nuclear accumulation (Fig. 3c, Extended Data 
Fig. 6b); SRE-promoter-driven luciferase activity (Extended Data 
Figs. 5s, 6c); and the expression of mRNA and proteins of SREBP1 tar- 
get genes that are associated with lipogenesis, including fatty acid 
synthase (FASN), acetyl-CoA carboxylase alpha (ACACA, also known 
as ACC1), stearoyl-CoA desaturase (SCD, also known as SCD1) and 
glycerol-3-phosphate acyltransferase 1 (GPAM, also known as GPAT), 
as well as SREBF1 (which encodes SREBP1)'’” (Fig. 3d, Extended 
Data Figs. 5t, 6d-f). Consequently, treatment with IGF1 resulted in 
increased incorporation of “C-glucose into triglycerides and fatty 
acids (Fig. 3e, Extended Data Fig. 6g). Notably, all of these IGF1-induced 
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Fig. 3 | PCK1-mediated phosphorylation of INSIG1/2 releases SCAP from 
the ERand promotes SREBP1 activation for lipogenesis. a—c, Parental Huh7 
cells and INSIG1(S207A)/INSIG2(S151A) knock-in cells were transfected 

with the indicated plasmids and stimulated with or without IGF1 for 16h. 
Immunoprecipitation and immunoblotting analyses (a) and 
immunofluorescence analyses (b, c) were performed as indicated. The arrows 
in bindicate Golgi-localized SCAP. The experiments were repeated three times 


effects were inhibited by knock-in expression of INSIG1(S207A)/ 
INSIG2(S151A) (Fig. 3c-e, Extended Data Figs. 5r-t, 6f) or PCK1(S90A) 
(Extended Data Fig. 6a—g). Furthermore, the IGF1-induced incorpora- 
tion of “C-glucose into fatty acids was strongly blocked by expression 
of PCK1(S90A) in CHL-1 melanoma cells, U87 glioblastoma cells and 
H1993 non-small-cell lung cancer cells (Extended Data Fig. 6h), suggest- 
ing that the lipogenesis that is promoted by phosphorylation of PCK1 
Ser90 occurs in several types of cancer. Consistently, knock-in expres- 
sion of PCK1(S90D) (Extended Data Figs. 2I-n, 6i) or INSIG1(S207D)/ 
INSIG2(S151D) (Extended Data Figs. 4a—d, 6j) in HCC cells was sufficient 
to induce SREBPI cleavage. 

Similar to our observations with SREBPI1, expression of PCK1(S90A) 
or INSIG1(S207A)/INSIG2(S151A) also inhibited IGF1-induced cleav- 
age of SREBP2 (Extended Data Fig. 6k, 1) and SREBP2-mediated tran- 
scription of genes that are related to cholesterol biogenesis”, suchas 
3-hydroxy-3-methylglutaryl-CoA (HMGC) reductase (HMGCR), HMGC 
synthase 1 (HMGCS1, also known as HMGCS), low-density lipoprotein 
receptor (LDLR) and squalene synthase (FDFT1, also known as SS) 
(Extended Data Fig. 6m, n). Thus, PCK1-mediated phosphorylation 
of INSIG1/2 promotes the activation of both SREBP1 and SREBP2, and 
the expression of downstream genes that are involved in lipogenesis. 

Because PCK1-mediated phosphorylation of INSIG1/2 promotes 
the release of sterols from their binding to INSIG1/2, we postulated 
that this phosphorylation does not affect the activation of SREBP1 
that is induced by lipid depletion. As expected, knock-in expression 
of INSIG1(S207A)/INSIG2(S151A) or PCK1(S90A) did not affect the 
lipid-depletion-induced translocation of SCAP from the ER to the Golgi 
apparatus (Extended Data Fig. 60-r), loss of colocalization of SCAP with 
calnexin (Extended Data Fig. 6s—u) and colocalization of SCAP with 
golgin-97 (Extended Data Fig. 6v-x), the cleavage and nuclear accumu- 
lation of SREBP1 (Extended Data Fig. 7a, b), and SRE-driven luciferase 
activity (Extended Data Fig. 7c). By contrast, incubation of HCC cells 
with 25-hydroxycholesterol—but not cholesterol—at a dosage (120 nM) 
much higher than physiological concentrations (40 nM)" blocked the 
dissociation of INSIG1/2 from SCAP, the cleavage of SREBP and the 
increase of SRE luciferase activity that were induced by IGF1 (Extended 
Data Fig. 7d, f) or by expression of phosphorylation-mimicking PCK1 
and INSIG1/2 mutants (Extended Data Fig. 7e, g). These results dem- 
onstrate that PCK1-mediated phosphorylation of INSIG1/2 promotes 


independently with similar results. d, The mRNA expression levels of SREBP1 
target genes were measured using quantitative PCR. *P=0.009, 0.002 (left to 
right); **P< 0.001 (two-tailed t-test). e, The incorporation of “C-glucose into 
triglycerides (TGs) (left) and fatty acids (FAs) (right) was measured. *P=0.01, 
0.03 (left to right); **P < 0.001 (two-tailed t-test). Dataind,e are mean+s.d. 
(n= 6 biological replicates). 


SREBP1 activation through the release of 25-hydroxycholesterol from 
INSIG1/2. 

SREBP1 can be activated both in a manner that depends on AKT 
and mTORCI, and independently of AKT and mTORC1™. It has been 
previously shown that insulin signalling and AKT activation sup- 
press the expression of INSIG2 by promoting the decay of its mRNA”. 
Inaddition, INSIG1, but not INSIG2, is ubiquitinated and degraded upon 
sterol depletion’®. A time-course experiment showed that INSIG2 was 
not obviously downregulated until prolonged treatment with IGF1 
(24-48 h) (Extended Data Fig. 7h), and this downregulation was not 
affected in PCK1(S90A) or INSIG1(S207A)/INSIG2(S151A) mutants 
(Extended Data Fig. 7i). By contrast, treatment of HCC cells with IGF1 
and cycloheximide (CHX)—which eliminates the translational regula- 
tion of protein expression—induced rapid degradation of wild-type 
INSIG1, but not INSIG1(S207A), wild-type INSIG2 or INSIG2(S151A) 
(Extended Data Fig. 7j, k). In addition, the half-life of INSIG1(S207D), 
but not INSIG2(S151D), was shortened (Extended Data Fig. 71, m), indi- 
cating that phosphorylation of INSIG1Ser207 promotes the degrada- 
tion of the INSIG1 protein. However, the overall expression of INSIG1 
inthe presence of IGFl and absence of CHX was not obviously altered 
(Extended Data Fig. 7h). This might be a result of an increase in /NS/G1 
transcription owing to SREBP activation”, which compensates for the 
degradation of INSIG1in a feedback manner. In contrast to the delayed 
response of INSIG2 expression to IGF1 stimulation, immediate phos- 
phorylation of PCK1 and INSIG1/2 (Extended Data Fig. 7n) and rapid 
cleavage of SREBP1 (Extended Data Fig. 7h) were detected after IGF1 
treatment. Thus, PCK1-mediated rapid phosphorylation of INSIG1/2 
and activation of SREBP1 is an immediate response to AKT activation. 

De novo fatty acid synthesis occurs in the liver, adipose tissue and 
lactating breast?°. Treatment of HL7702 and THLE-2 normal hepato- 
cytes with IGF1 induced phosphorylation of PCK1 Ser90, subsequent 
phosphorylation of INSIG1 Ser207 and INSIG2 Ser151, and SREBP1 
cleavage (Extended Data Fig. 8a). The latter effect was also induced 
by expression of PCK1(S90D) in these cells (Extended Data Fig. 8b), 
which suggests that PCK1-induced activation of SREBP1 also occurs 
innormal hepatocytes. 

Under normal physiological conditions, high blood glucose levels 
after a meal increase the pancreatic secretion of insulin, which imme- 
diately activates the phosphoinositide 3-kinase (PI3K)-AKT signalling 


Nature | Vol580 | 23 April 2020 | 533 


Article 


pathway and leads to an increase in glucose utilization and a reduction 
in gluconeogenesis in the liver”. Of note, phosphorylation of AKT, PCK1 
Ser90, INSIGI Ser207 and INSIG2 Ser151, and cleavage of SREBP1, were 
markedly enhanced in normal liver from mice that were refed with 
glucose after 24 h of fasting (Extended Data Fig. 8c)—suggesting that 
in vivo blood glucose levels regulate the PCK1-mediated phosphoryla- 
tion of INSIG1/2 and the activation of SREBP1in the liver. These results 
demonstrate the relevance of our findings in the context of the physi- 
ological functions of the liver. 

We then examined a potential difference in the PCK1-mediated activa- 
tion of SREBP1 between normal hepatocytes and HCC cells. We found 
that there was a substantial increase in the phosphorylation of AKT, 
PCK1 Ser90, INSIG1 Ser207 and INSIG2 Ser151—and in the cleavage 
of SREBP1—in HCC cells compared with normal human hepatocytes 
(Extended Data Fig. 8d). These increases were reduced by expression 
of PCK1(S90A) in HCC cells (Extended Data Fig. 8e), indicating that 
PCK1-mediated SREBP1 activation is increased in HCC cells witha high 
level of AKT activation. 

Notably, these increases in the phosphorylation of AKT, PCK1 and 
INSIG1/2, and in the cleavage of SREBP1, were further enhanced or 
induced in Huh7, CHL-1, U87 and H1993 cancer cells that express 
the active KRAS(G12V) mutant (Extended Data Fig. 8f, g), a mutant 
version of the IGF1 receptor (IGF1R) (IGFIR(V922E)) (Extended Data 
Fig. 8h) or a mutant version of the epidermal growth factor recep- 
tor (EGFR) (EGFRvIII) (Extended Data Fig. 8i), and in cells treated 
with platelet-derived growth factor (PDGF) (Extended Data Fig. 8)). 
Expression of PCK1(S90A) (Extended Data Fig. 8f-j) or INSIG1(S207A)/ 
INSIG2(S151A) (Extended Data Fig. 8f, h) inhibited INSIG1/2 phospho- 
rylation and SREBPI cleavage in these cells. These results suggest that 
PCK1-mediated activation of SREBP1is induced by AKT activation that is 
elicited by different oncogenes or growth factors and occurs in several 
types of cancer. 


The PCK1-INSIG1/2 axis promotes HCC growth 


Notably, reconstituted expression of PCK1(S90A) in different HCC 
celllines—which reduced the basal level of phosphorylation of INSIG1 
Ser207 and INSIG2 Ser151 and the cleavage of SREBP1 (Extended Data 
Fig. 8k)—inhibited the proliferation of cells under normal culture condi- 
tions (Extended Data Fig. 8m), without altering the cleavage of SREBP1 
and the decrease in cell survival induced by lipid depletion (Extended 
Data Fig. 81, n). Similarly, knock-in expression of INSIG1(S207A)/ 
INSIG2(S151A) or PCK1(S90A) inhibited the proliferation of Huh7 cells 
(Fig. 4a) and Hep3B cells (Extended Data Fig. 9a). By contrast, expres- 
sion of PCK1(S90D) or INSIG1(S207D)/INSIG2(S151D) mutants enhanced 
cell proliferation (Extended Data Fig. 9b, c). 

Next, we intrahepatically (Fig. 4b) and subcutaneously (Extended 
Data Fig. 9d-f) injected Huh7 cells with or without knock-in expres- 
sion of PCK1(S90A) or INSIG1(S207A)/INSIG2(S151A) into nude mice. 
Expression of these mutant proteins substantially inhibited tumour 
growth in the mice (Fig. 4b, Extended Data Fig. 9d-f), with a corre- 
sponding reduction in Ki67 expression (Extended Data Fig. 9g) andan 
increase in cell apoptosis (Extended Data Fig. 9h). In addition, expres- 
sion of PCK1(S90A) reduced the phosphorylation of INSIG1 Ser207 
and INSIG2 Ser151, and reduced the expression and nuclear distribu- 
tion of SREBP1 (Fig. 4c, Extended Data Fig. 9, j, left). In line with this, 
expression of INSIG1(S207A)/INSIG2(S151A) also decreased the nuclear 
levels of SREBP1 (Fig. 4d, Extended Data Fig. 9i, j, right). By contrast, 
expression of PCK1(S90D) or INSIG1(S207D)/INSIG2(S151D) promoted 
tumour growth (Extended Data Fig. 9k-m). 

Consistent with the finding that AKT-PCK1 signalling activates 
SREBP1, expression of active IGFIR(V922E) (Extended Data Fig. 10a— 
c) or myr-AKT (Extended Data Fig. 10d-f) in Huh7 cells significantly 
increased the growth of liver tumours, with enhanced phosphoryla- 
tion of PCK1 Ser90 and INSIG1 Ser207/INSIG2 Ser151 and enhanced 
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Fig. 4| PCK1-mediated phosphorylation of INSIG1/2 promotes liver tumour 
growth and correlates with poor prognosis for HCC. a, Parental Huh7 cells 
(2x 105) and the indicated clones with knock-in expression of PCK1(S90A) (left) 
or INSIG1(S207A)/INSIG2(S151A) (right) were plated for 3 days. The cells were 
then collected and counted. **P< 0.001 (two-tailed t-test). b, Huh7 cells with or 
without knock-in expression of PCK1(S90A) (top) or INSIG1(S207A)/ 
INSIG2(S151A) (bottom) were intrahepatically injected into athymic nude mice 
(n=7 per group). Tumour growth was examined 28 days after injection. The 
arrows indicate tumours. Tumour volumes were calculated (right). *P=0.03, 
0.02 (left to right, top);*P= 0.02, 0.03 (left to right, bottom) (two-tailed ¢-test). 
Dataina, bare mean+s.d.c-e, IHC analyses of xenograft tumours from nude 
mice (n=7) (c,d) and 90 human HCC samples (e) were performed with the 
indicated antibodies. Representative staining images are shown. The regions 
in white boxes are shownat higher magnification below. f, Kaplan-Meier plots 
of the overall survival rates in 90 patients with HCC grouped according to high 
(staining score, 4-8) and low (staining score, 0-3) expression of PCK1(pS90) 
(top) and INSIG1(pS207)INSIG2(pS151) (bottom) (n= 38 (PCK1(pS90) low), 
n=52 (PCK1(pS90) high); n =39 (INSIG1(pS207)/INSIG2(pS151) low),n=51 
(INSIG1(pS207)/INSIG2(pS151) high). Pvalues were calculated using a log-rank 
test (two-tailed). 


nuclear expression of SREBP1. Knock-in expression of PCK1(S90A) or 
INSIG1(S207A)/INSIG2(S151A) inhibited both basal tumour growth 
and that induced by IGF1R(V922E) (Extended Data Fig. 10a—c) or 
myr-AKT (Extended Data Fig. 10d-f). By contrast, expression of a 
dominant-negative IGFIR(L1003R) mutant reduced both intrahe- 
patic (Extended Data Fig. 10g, h) and subcutaneous (Extended Data 
Fig. 10i-n) tumour growth, with reduced phosphorylation of PCK1 and 
INSIG1/2 (Extended Data Fig. 100) and reduced nuclear accumulation 
of SREBP1 (Extended Data Fig. 10p, q). The reduction in tumour growth 
(Extended Data Fig. 10g-n) and nuclear SREBP1 expression (Extended 
Data Fig. 10p, q) was partially reverted by expression of PCK1(S90D) 
or INSIG1(S207D)/INSIG2(S151D). 

Furthermore, we used hydrodynamics-based transfection in mice to 
administer plasmids for the expression of active myr-AKT, c-Met and 
the sleeping beauty transposase (which induces rapid liver tumour 
growth”) together with wild-type PCK1 or PCK1(S90A). Expression of 
PCK1(S90A) reduced tumour growth, Ki67 expression, INSIGI/2 phos- 
phorylation and nuclear SREBP1 expression (Extended Data Fig. 10r-t). 
Thus, PCK1-mediated phosphorylation of INSIG1/2 and subsequent 
activation of SREBP1 promote the development of HCC. 

To determine the clinical relevance of PCK1-regulated SREBP1 activa- 
tion, we performed immunohistochemistry (IHC) analyses of 30 paired 


samples of primary HCC and adjacent normal tissue. Phosphorylation 
of PCK1Ser90 and INSIG1 Ser207/INSIG2 Ser151—and nuclear SREBP1 
expression—were markedly increased in the HCC specimens compared 
with normal tissue (Extended Data Fig. 1Ou-w), and correlated with 
each other in 90 resected HCC tumours (Fig. 4e, Extended Data Fig. 10x, 
y). Notably, in HCC samples, high levels of phosphorylation of PCK1 
Ser90 and INSIGI Ser207/INSIG2 Ser151—and high levels of nuclear 
SREBP1 expression—were correlated with decreased overall durations 
of survival in patients with HCC (Fig. 4f). These results suggest that 
PCK1-mediated phosphorylation of INSIGI/2 has a critical role in the 
clinical aggressiveness of human HCC. 

Metabolism and gene expression are two fundamental cellular pro- 
cesses that are essential for the proliferation of tumour cells and that 
can be mutually regulated”’. PCK1 was originally characterized as a 
gluconeogenesis enzyme. Herein we have shown that PCK1 has protein 
kinase activity and translocates to the ER to regulate SREBP1 activation 
and SREBP1-mediated gene expression (Extended Data Fig. 10z). We 
also demonstrated that a metabolic enzyme uses GTP, rather than ATP, 
as a phosphate donor to phosphorylate a protein substrate. In addi- 
tion, we have shown that oncogenic signalling can rapidly modulate 
the association between INSIG1/2 and sterol without reducing total 
cellular sterol levels. Our findings highlight the potential for inhibi- 
tion of the protein kinase activity of PCK1 as a treatment strategy in 
human HCC. 
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Methods 


Materials 

Normal mouse IgG (sc-2025), normal rabbit IgG (sc-2027), GST 
(sc-138), tubulin (sc-8035), ERK1/2 (sc-514302) and INSIG1 (A-9) (sc- 
390504) (for immunoprecipitation or immunoblotting) antibodies 
were obtained from Santa Cruz Biotechnology. PCK1 (16754-1-AP) 
(for immunoprecipitation or immunoblotting), INSIG1 (22115-1-AP) 
(for immunofluorescence or immunoblotting) and INSIG2 (24766-1-AP) 
(for immunoprecipitation or immunoblotting) antibodies were pur- 
chased from Proteintech. Anti-AKT(pS473) (4060), rabbit AKT (9272), 
mouse AKT (2920), c-SRC (2108), haemagglutinin (HA) (3724), p44/42 
MAPK (ERK1/2) (9102), c-Jun (9165), c-Jun(pS73) (3270), Myc-tag (9B11) 
(2276) and PCK1 (D12F5) (12940) antibodies (for immunoblotting) 
were purchased from Cell Signaling Technology. Rabbit antibodies 
that recognize PCK1(pS90), INSIG1(pS207) and INSIG2(pS151) were 
obtained from Signalway Biotechnology. IGF1 (85580C), PDGF (P8147), 
cyclohexamide (CHX) (01810), oxaloacetate (07753), cholesterol 
(C8667), 25-hydroxycholesterol (H1015), mouse monoclonal anti-Flag 
(F1804), rabbit anti-Flag (F7425) and anti-His (SAB1305538) antibod- 
ies were purchased from Sigma-Aldrich. Anti-Flag M2 agarose beads 
were purchased from MP Biochemicals. [7H]25-hydroxycholesterol, 
[y-P]ATP and [y-”P]GTP were obtained from PerkinElmer. Inactive 
AKTI1 protein (14-279-D) and rabbit anti-Ki67 antibody (AB9260) were 
obtained from Millipore. Oxaloacetate assay kit (ab83428), anti-Myc 
tag rabbit (ab9106), calnexin (ab22595), SRC(pY418) (ab4816), SCAP 
(ab91323), golgin-97 (ab84340), SCD1 (CD.E10) (ab19862), FASN 
(ab22759) and anti-rabbit IgG heavy chain (HRP) (ab99702) antibod- 
ies were purchased from Abcam. U0126, SP600125, MK-2206 and 
SU6656 were purchased from EMD Biosciences. Active GST-AKT1 
(A16-10G) was obtained from SignalChem. Active recombinant 
His-AKT1 protein (LS-G18427) was obtained from Lifespan Biosciences. 
SREBP1 (IgG 2A4) and SREBP2 antibodies (557037) for immunoblotting 
analyses were purchased from BD Biosciences. SREBP1 antibody (2A4) 
(NB100-2215) (for immunofluorescence and IHC analyses) and PCK1 
mouse antibody (3E4) (HO0005105-M1) (for immunofluorescence or 
immunoblotting) were purchased from Novus. INSIG2 (PA5-41707) 
and INSIG1 (PAS-97876) antibodies (for immunoblotting), calnexin 
(AF18) (MA3-027), golgin-97 (CDF4) (A-21270), glutathione agarose, 4’, 
6-diamidino-2-phenylindole (DAPI), Alexa Fluor 488 goat anti-rabbit 
(A11008), Alexa Fluor 594 goat anti-rabbit (A11012), Alexa Fluor 
488 goat anti-mouse (A11029) and Alexa Fluor 594 goat anti-mouse 
antibodies (A11005) were obtained from Thermo Fisher Scientific. 
The phosphoenolpyruvate carboxykinase activity assay kit (K359) was 
purchased from BioVision. PCK1(pS90) peptide (DVARIE-pS-KTVIVT), 
INSIG1/2(pS207/S151) peptide (WWTFDR-pS-RSGLGL) and PCK1(S90) 
covering peptide (WWTFDRSRSGLGL) were synthesized by Selleck- 
Chem. CIP was obtained from New England BioLabs. Ni-NTA agarose 
was obtained from Qiagen. 


DNA construction and mutagenesis 

PCR-amplified human wild-type PCK1, PCK2, INSIG1 and INSIG2 and 
IGF1R were cloned into pcDNA3.1/hygro(+)-Flag, -HA, -His, or -Myc, 
pCDH-CMV-MCSEF1-Puro-SFB or pET32a vectors. SCAP was cloned 
into pcDNA3.1/hygro(+)-Myc. pECE-Myr-HA-AKT1(delta4-129), 
MSCV-XZ066-EGFR vIII and pBabe-puro-KRAS(G12V) were pur- 
chased from Addgene. EGFR vIII and KRAS(G12V) were cloned into 
pcDNA3.1/hygro(+)-Flag. Flag/His-double-tagged INSIGI1 and INSIG2 
were constructed in a pFastBacHTa expression vector (Invitrogen) as 
described”. pT3-EFla-c-Met was a gift from X. Chen (Addgene plas- 
mid 31784). pbCMV(CAT)T7-SB100 was a gift from Z. Izsvak (Addgene 
plasmid 34879). Flag—PCK1 (wild-type or S90A-mutant) and HA-myr- 
AKT were cloned into a pT3-EFla vector. Myc-SCAP was a gift from Y. 
Chen at Shanghai Institute of Biological Sciences, Chinese Academy 
of Sciences. 


pcDNA3.1 Flag-rPCK1, Flag-rPCK1(S90A), Flag-rPCK1(S90E), 
SFB-PCK1(S90A), pGEX-4T-1 PCK1(S90A), HA-AKT-DN (K179A, 
T308A, S473A), Flag-rPCK1(C288S), HA-rPCK1(C288S), PGEX-4T-1 
PCK1(C228S), pET22b PCK1(S90A), pET22b PCK1(C288S), SFB- 
INSIG1(S207A), SFB-INSIG2(S151A), Flag-IGFIR(V922E), Flag- 
IGFIR(L1003R), Flag/His-double-tagged INSIG1(S207A) and 
INSIG1(S207E), Flag-INSIG1(S207D), Flag/His-double-tagged 
INSIG2(S151A) or INSIG2(S151E), Flag—-INSIG2(S151D) and short hair- 
pin RNA (shRNA)-resistant PCK1 constructs containing nonsense 
mutations of G948A, T951G, C954T and A957C were constructed 
using a QuikChange site-directed mutagenesis kit (Stratagene). pGIPZ 
shRNA was constructed via ligation of an oligonucleotide targeting 
human PCK1 into an Xhol/Mlul-digested pGIPZ vector. The 
following pGIPZ shRNA target sequences were used: control 
shRNA oligonucleotide, 5’°-GCTTCTAACACCGGAGGTCTT-3’; PCK1 
shRNA oligonucleotide, 5’- TGTGCGTCAAACTTCATCC-3’. INSIGI 
shRNA oligonucleotides, 5’- TAATGGTGTCTATCAGTATAC-3’ and 
5’- GGAACATAGGACGACAGTTA-3’; INS/IG2 shRNA oligonucleotides, 
5’-CATCTAGGAGAACCTCATAAA-3’ and 5’- CTTCAGCTGTGATTGGGTT 
-3’; SCAP shRNA oligonucleotides, 5’- CTCTTCAGCTAT TACAACA -3’ 
and 5’- AGGAAGAGGATGGTCTCCT -3’. 


Cell lines and cell culture conditions 

Hep3B, Huh7, H1993, CHL-1, SNU-398, SNU-475, HL7702, THLE-2 and 
293T cells were from ATCC. The cells were maintained in complete 
medium containing Dulbecco’s modified Eagle’s medium (DMEM), 
10% fetal bovine serum (FBS), 1,000 U ml" penicillin and 100 pg mI 
streptomycin. Lipid-depleted medium contains DMEM supplemented 
with 5% LPDS and lovastatin (5 1M). Before IGF1 (100 ng mI”) treat- 
ment, the cells were serum-starved for 16 h. U0126 (20 uM), SU6656 
(4 uM), SP600125 (25 1M), or MK-2206 (10 tM) was added 30 min before 
treatment with or without IGF1 (100 ng mI) for 1h. No cell lines used in 
this study were found in the database of commonly misidentified cell 
lines that is maintained by the International Cell Line Authentication 
Committee and NCBI Biosample. Cell lines were authenticated by short 
tandem repeat profiling and were routinely tested for mycoplasma 
contamination at The University of Texas MD Anderson Cancer Center. 
Cells were plated at a density of 4 x 10° per 60-mm dish or 1 10° per well 
of a 6-well plate 18 h before transfection. The transfection procedure 
was performed as previously described’. 


Immunoprecipitation and immunoblotting analysis 

The extraction of proteins using a modified buffer from cultured cells 
was followed by immunoprecipitation and immunoblotting using cor- 
responding antibodies as described previously’”. 


GST pull-down assay 

Equal amounts of His-tagged purified protein (200 ng per sample) were 
incubated with 100 ng of GST fusion proteins together with glutathione 
agarose beads in a modified binding buffer (SO mM Tris-HCl at pH 7.5, 
1% Triton X-100, 150 mM NaCl, 1mM DTT, 0.5 mM EDTA, 100 pM PMSF, 
100 uM leupeptin, 1 uM aprotinin, 100 LM sodium orthovanadate, 
100 uM sodium pyrophosphate, 1mM sodium fluoride). The glutathione 
agarose beads were then washed four times with binding buffer and 
then subjected to immunoblotting analysis as previously described”. 


Purification of recombinant proteins 

Wild-type GST-PCK1, GST-PCK1(S90A), GST-PCK1(C288S), His-PCK1, 
His-PCK1(S90A), His-PCK1(C228S) were expressed in bacteria and 
purified as described previously”®. Flag/His-double-tagged INSIG1 and 
INSIG2 were purified as previously described>. 


[?H]25-hydroxycholesterol binding assay 
[?H]25-hydroxycholesterol binding reactions were performed as previ- 
ously described”. In brief, each reaction was performed inafinal volume 


of 100 pl of buffer A (SO mM Tris-HCl at pH 7.5, 150 mM NaCl, 1mM dithi- 
othreitol, 0.1% Fos-choline 13, 0.005% sodium azide). It contained 1.2 ng 
(400 nM) of purified wild-type or mutants of Flag/His-double-tagged 
INSIG1 or INSIG2, 10-500 nM [?H]25-hydroxycholesterol and 25 mM phos- 
phocholine chloride. After incubation for 4 hat room temperature, the 
mixture was passed througha column packed with 0.3 ml of Ni-NTA aga- 
rose beads (Qiagen). Each column was then washed 3 times with 10 ml of 
buffer B (SO mM Tris-HCl at pH 7.5, 150 mM NaCl, 1mM dithiothreitol and 
0.1% (w/v) Anapoe-C12E9). The protein-bound [?H]25-hydroxycholesterol 
was eluted with 250 mM imidazole and measured by scintillation 
counting. 


Cell fractionation 

The cell fraction assay was performed according toa previously reported 
method”. Inbrief,4010-cm dishes of parental Huh7 cells and the indicated 
clones of cells with INSIGI(S207A)/INSIG2(S151A) double knock-in expres- 
sion or PCK1(S90A) knock-in expression were stimulated with or without 
IGF1(100 ng mI) or incubated with or without lepid-depleted medium for 
16 h. The cells were then placed on ice and washed twice with PBS and 
homogenization buffer (10 mM triethanolamine-acetic acid, pH 7.4,0.25 M 
sucrose, 1mM sodium EDTA, protease inhibitor cocktail (Roche)). The 
washed cells were collected in 0.8 ml homogenization buffer and homoge- 
nized by passing through a 25-gauge needle ona1-ml syringe 13 times. After 
centrifugation at2,000gfor15 minat4 °C, the post-nuclear supernatant was 
collected andloaded on preformed iodixanol (Sigma-Aldrich) gradients. 
The discontinuous iodixanol gradients were prepared as 2.65 ml 
of 24%, 19.33%, 14.66% and 10%, which were made by diluting a 60% 
stock of iodixanol with cell suspension medium (0.85% (w/v) NaCl, 
10 mM Tricine-NaOH, pH 7.4). After standing at room temperature for 
2h, the gradients were then centrifuged at 37,000 rpm in a SW40Ti 
rotor (Beckman Instruments) for 4 h. The post-nuclear supernatant 
was loaded on the top of the gradients and centrifuged at 37,000 rpm 
for another 2 h. Deceleration was performed without a brake. 
A total of 15 fractions (800 pl per fraction) were collected from the top 
to the bottom, and the bottom two fractions containing aggregated 
material were not analysed further. Aliquots of each fraction were 
used for further analysis by immunoblotting. ER fractions were 
isolated from the cells using an Endoplasmic Reticulum Isolation 
Kit (ERO100, Sigma-Aldrich). ER proteins were used in immunoblot 
analyses. 


Conversion of radiolabelled glucose to triglycerides and fatty acids 
The incorporation of the various radioactive substrates into triglyc- 
erides and fatty acids was measured as previously described”®. In 
brief, cells were washed twice with PBS and incubated in 1 ml labelling 
medium (2.5% fatty-acid-free bovine serum albumin, 1% (v/v) penicillin/ 
streptomycin, 0.5 mM D-glucose, 0.5 mM sodium acetate, 2mM sodium 
pyruvate, 2 Ci ml'™C-U-glucose or “C-acetate) at 37 °C ina humidi- 
fied incubator (5% CO.) for 4.5 h before lipid extraction. All metabolic 
processes were stopped by washing cells twice with cold PBS and 
lysing them with the addition of modified Dole’s extraction mixture 
(80 ml isopropanol, 20 ml hexane, 2 ml 0.5 MH,SO,). Triglycerides were 
extracted with hexane and washed, and the solvent was evaporated. 
The incorporation of “C-glucose into fatty acids and triglycerides was 
determined by evaporating the solvent from neutral lipids, adding 1 ml 
of KOH-ethanol (20 ml 95% ethanol, 1 ml water, 1 ml saturated KOH) and 
heating samples to 80 °C for 1h. Sulfuric acid was added to the mixture 
to ensure complete saponification. Addition of hexane allowed for 
hydrophobic separation. The hydrophilic portion was evaporated and 
counted using liquid scintillation. Incorporation data were normalized 
according to cell number. 


Invitro kinase assay 
In vitro kinase assays were performed as previously reported”. 
In brief, for the AKT in vitro kinase assay, purified active GST-AKT 


(A16-10G, SignalChem) (500 ng) was incubated with bacterially purified 
His—PCK1 (200 ng) in 25 pl kinase buffer (SO mM Tris-HCl at pH 7.5, 
100 mM KCI, 50 mM MgCl, 1 mM Na,VO,, 1 mM DTT, 5% glycerol, 
0.5 mM ATP and 10 pCi [y-**P]ATP) at 25 °C for 1h. The reaction was 
terminated by adding SDS-PAGE loading buffer and heated at 100 °C 
for 5 min. The reaction mixture was then subjected to an SDS-PAGE 
analysis. For PCK1 in vitro kinase assay, bacterially purified wild-type 
His-PCK1 or His-PCK1(S90A) on the Ni-NTA agarose beads was incubated 
with or without GST-AKT1in the presence of ATP for 1h. After the in vitro 
AKT kinase assay, the Ni-NTA agarose beads were washed in PBS five times 
and incubated with or without SFB-tagged wild-type or mutant INSIGI/2 
using 50 pl kinase buffer in the presence of 0.5 mM GTP and 10 Ci [y-*P] 
ATP or [y-?P]GTP at 25 °C for 1h. Autoradiography was performed. 


Mass spectrometry analyses 

For identification of interacting proteins, a protein band visualized 
via Coomassie blue staining was excised from an SDS-PAGE gel and 
digested in gel in 50 mM ammonium bicarbonate buffer containing 
RapiGest (Waters Corporation) overnight at 37 °C with 200 ng of modi- 
fied sequencing-grade trypsin (Promega). The digested protein samples 
were analysed using high-sensitivity LC-MS/MS with an Orbitrap Elite 
mass spectrometer (Thermo Fisher Scientific). Proteins were identified 
by searching the fragment spectra against the UniProt protein database 
(EMBL-EBI) using the Mascot search engine (v.2.3; Matrix Science) with 
the Proteome Discoverer software program (v.1.4; Thermo Fisher Scien- 
tific). For detection of phosphorylation sites, in vitro phosphorylation 
of PCK1 by AKT and INSIG1/2 by PCK1 were performed according to the 
in vitro kinase assay protocol described above. Then the protein samples 
were digested using trypsin or chymotrypsin and analysed using LC-MS/ 
MS with the Orbitrap Elite mass spectrometer as described previously”. 


Determining the K,, of PCK1 

The K,, of PCK1 was determined using a GTPase assay kit (MAK113, 
Sigma-Aldrich) according to the manufacturer’s instructions. In brief, 
purified recombinant wild-type PCK1 (10 ng) was incubated in 100 pl of 
reaction buffer (40 mM Tris-HCl at pH 7.5, 80 mM NaCl, 10 mM synthetic 
INSIG1 peptide substrate (LWWTFDRSRSGLGLG), 8 mM magnesium 
acetate, 1mM DTT) with different concentrations of GTP at 37 °C in 
96-well plates. The plates were read by multi-detection microplate 
readers (BMG Labtech) at 620 nm in kinetic mode for 5 min. K,, and 
Vinax Were calculated from a plot of 1/V versus 1/[substrate] according 
to the Lineweaver-Burke plot model. 


Determination of oxaloacetate binding affinity and detection of 
PCK1activity 

Ni-NTA beads and purified recombinant wild-type His-PCK1, 
His-PCK1(S90E) or AKT-phosphorylated His—-PCK1 were incubated in 
200 pl binding buffer (SO mM Tris-HCl at pH 7.5, 100 mM KCI, 50 mM 
MgCl,, 50 mM MnCl, 1mM Na,VO,) containing 200 uM oxaloacetate 
(07753, Sigma-Aldrich) at 30 °C for 30 min. The oxaloacetate associated 
with His-PCK1 on the beads was then centrifuged at 12,000 rpm for 
10 min. The oxaloacetate remaining in the supernatant was quantified 
using an oxaloacetate assay kit (ab83428, Abcam). The oxaloacetate 
binding affinity was calculated according to the percentages of the 
remaining oxaloacetate in the supernatants. The activity of PCK1 was 
determined using a phosphoenolpyruvate carboxykinase activity assay 
kit (K359, BioVision). 


CRISPR-Cas9-mediated genome editing 

Genomic mutations were introduced into cells using the CRISPR- 
Cas9 system, as described previously*®. Single-guide RNAs (sgRNAs) 
were designed to target the genomic area adjacent to mutation sites 
in PCK1(S90A), INSIG1(S207A) and INSIG2(S151A) using the CRISPR 
design tool (http://crispr.mit.edu/). The annealed guide RNA oligonu- 
cleotides were inserted into a PX458 vector (Addgene) digested with 
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the Bbsl restriction enzyme”. Cells were seeded at 60% confluence, 
followed by co-transfection of sgRNAs (0.5 pg) and single-stranded 
donor oligonucleotide (10 pmol) as a template to introduce muta- 
tions. Twenty-four hours after transfection, cells were trypsinized, 
diluted for single cells and seeded into 96-well plates. GGnomic DNA 
was extracted from GFP-positive cells, followed by sequencing of the 
PCR products spanning the mutation sites. sgRNA targeting sequence 
for INSIG1(S207A): 5’-ACATTTGATCGT TCCAGAAG-3’; single-stranded 
donor oligonucleotide (ssODN) sequence for INSIG1(S207A): 
5’-AAAT TGGAT TT TGCCAATAATGTCCAGCTGTCCTTGACTT TAGCAGCC 
CTATCTTTGGGCCTT TGGTGGACAT TTGATCGcgCCAGgAGcGGCCTT 
GGGCTGGGGATCACCATAGCTTT TCTAGCTACGCTGATCACGCAGTTT 
CTCGTGTATAATGGTGTCTATCA-3’; ssODN sequence for INSIG1 
(S207D): 5’-AAAT TGGAT TT TGCCAATAATGTCCAGCTGTCCTTGAC 
TTTAGCAGCCCTATCTT TGGGCCT TTGGTGGACATTTGATCGcgaCAGgA 
GcGGCCTTGGGCTGGGGATCACCATAGCTT TTCTAGCTACGCTGATCAC 
GCAGTTTCTCGTGTATAATGGTGTCTATCA-3’;sgRNA targeting sequence 
for INSIG2(S1S1A):5’-ACTTT TGATAGATCTAGAAG-3’;ssODN sequence for 
INSIG2(S151A): 5’°-AAAGTGGATT TCGATAACAACATACAGT TGTCTCTC 
ACACTGGCTGCACTATCCATTGGACTGTGGTGGACTTTTGATAGggCTA 
GgAGcGGTTTTGGCCT TGGAGTAGGAATTGCCTTCTTGGCAACTGTGGT 
CACTCAACTGCTAGTATATAATGGTGTTTACCA-3’. SSODN sequence for 
INSIG2(S151D): 5’-AAAGTGGAT T TCGATAACAACATACAGTTGTCTCTCA 
CACTGGCTGCACTATCCATTGGACTGTGGTGGACTTTTGATAGggaTA 
GgAGcGGTTTTGGCCT TGGAGTAGGAATTGCCTTCTTGGCAACTGTGGT 
CACTCAACTGCTAGTATATAATGGTGTTTACCA-3’. sgRNA target- 
ing sequence for PCK1(S90A): 5’-GGCCAGGATCGAAAGCAAGA-3’; 
ssODN sequence for PCK1(S90A): 5’-CCGTGGTGCTTGGCTGA 
AAGGAAGCCTGTGA TTTTTGCAGCTGGTTGGCTCTCACTGACCC 
CAGGGATGTGGCCAGGATaGAggcCAAaACGGT TATCGTCACCCAAGA 
GCAAAGAGACACAGTGCCCATCCCCAAAACAGGCCTCAGCCAGCTCG 
GTCGCTGGATGTCAGAGG-3’. The lower-case letters in the ssODN 
sequences indicate the mutated nucleotides that will replace the 
endogenous nucleotides in the genomic DNA of parental cells 
using the CRISPR-Cas9 system. Genotyping was performed by 
sequencing PCR products amplified from the following prim- 
ers: INSIG1 forward: 5’-AGAATGGGGCTATCGATGACTTC-3’; INSI 
Gl reverse: 5’-TGTAGTGGGGATATGCAGAACG-3’; INSIG2 forward: 5’- TC 
AAGTTCCTGTACGATTCTCAAGT-3’; INSIG2 reverse: 5’-AGCAA 
ACAAGCACCAAAAATTG-3’; PCK1 forward: 5’-AAGGCCTTCG 
GGTAGTTTCAG-3’; PCK1 reverse: 5’-AGCCCCCTGGGT TAGAAGAG-3’. 


SRE luciferase assay 

SRE luciferase activity in cell lysates was measured with the luciferase 
assay system as previously described”. In brief, Huh7 cells in 24-well 
plates were transfected with 0.1 pg SRE-driven luciferase reporter and 
0.075 pg B-galactosidase. At 24 h after the transfection, the cells were 
subjected to different culture media. Cell lysates were measured for 
the activity of luciferase and B-galactosidase on the basis of the manu- 
facturer’s instruction (Promega). 


Immunofluorescence analysis 

Immunofluorescence analysis was performed as previously reported™. 
Cultured cells were fixed by 4% paraformaldehyde (PFA), treated with 
0.1% Triton X-100 for 5 min and blocked in3% BSA for 1h. The cells were 
then incubated with primary antibodies at a dilution of 1:100. For tis- 
sue staining, tumour masses from mice were perfused with 0.1 M PBS 
(pH 7.4), embedded into optimal cutting temperature compound and 
frozen for cryostat section. Cryostat sections were fixed with 4% PFA for 
15 min at room temperature. After PBS washing, cryostat sections were 
incubated in the blocking solution (PBS containing 3% donkey serum, 
1% BSA, 0.3% Triton X-100 at pH 7.4) for 30 min at room temperature. In 
antibody reaction buffer (PBS plus 1% BSA, 0.3% Triton X-100 at pH 7.4), 
samples were stained with primary antibodies against SREBP1 antibody 
overnight at 4 °C. After incubation with fluorescent-dye-conjugated 


secondary antibodies and DAPI, immunofluorescent microscopic 
images of the cells were obtained and viewed using an IX81 confocal 
microscope (Olympus America). Colocalization of proteins was quanti- 
fied by calculating Pearson’s correlation coefficient using the Coloc 2 
plugin in ImageJ (National Institutes of Health). 


Cell viability analysis 

Cells (2 x 10°) were plated in DMEM with 10% FBS (complete medium). 
After treatment with lipid-depleted medium for the indicated time, the 
viable cells were stained with trypan blue (0.5%) and counted using a 
Cell Viability Analyzer (Beckman Coulter). 


Quantitative PCR 

Total RNA was extracted from cells and tissue samples using TRI- 
zol reagent according to the manufacturer’s instructions (Inv- 
itrogen). Equal amounts of RNA samples were used for cDNA 
synthesis with a TaqMan Reverse Transcription Reagents kit 
(Applied Biosystems). Quantitative PCR analysis was carried out 
using a 7500 Real-Time PCR system (Applied Biosystems) with a 
SYBR Premix Ex Taq kit (Takara Bio). The following primers were 
used for quantitative PCR: FASN, 5’-CACAGGGACAACCTGGAGTT-3’ 
and 5’-ACTCCACAGGTGGGAACAAG-3’; SCD, 5’-CGACGTGGCTT 
TTTCTTCTC-3’ and 5’-CCTTCTCTTTGACAGCTGGG-3’; ACACA, 
5’-AGTGGGTCACCCCATTGTT-3’ and 5’- TTCTAACAGGAGCTGGAGCC-3’; 
GPAM, 5’-TTGTGGCTTGCCTGCTCCTCTA-3’ and 5’-AATCACGAG 
CCAGGACTTCCTC-3’; HMGCR, 5’-TCTGGCAGTCAGTGGGAACTATT-3’ 
and 5’-CCTCGTCCTTCGATCCAATTT-3’; HMGCS1, 5’-GATGTG 
GGAATTGTTGCCCTT-3’ and 5’-AT TGTCTCTGT TCCAACTTCCAG-3’; 
LDLR, 5’-AACGGTCATTCACCCAGGTC-3’ and 5’-GGCTGAAGA 
ATAGGAGTTGCC-3’; FDFT1, 5’-CGATAGCT GTGTGCAAAGTAACT-3’ 
and 5’-CCATCTGCTGAGTGCTTTCTG-3’; GAPDH, 5’-AGCCACATCGC 
TCAGACAC-3’ and 5’-GCCCAATACGACCAATCC-3’. 


TUNEL assay 

Mouse tumour tissues were sectioned at 5-m thickness. Apoptotic 
cells were counted using the DeadEnd Colorimetric TUNEL System 
(Promega) according to the manufacturer’s instructions. 


IHC analysis and histological evaluation of human HCC specimens 
Human HCC tissue collection and study approval were described previ- 
ously**, Human HCC and adjacent matched non-tumour tissue samples 
(EHBH cohort) were obtained from Eastern Hepatobiliary Surgery 
Hospital in Shanghai, China. The use of human HCC samples and the 
relevant database was approved by the Eastern Hepatobiliary Surgery 
Hospital Research Ethics Committee and complied with all relevant ethi- 
cal regulations. All tissue samples were collected in compliance withthe 
informed consent policy. Sections of paraffin-embedded human HCC 
samples were stained with antibodies against AKT(pS473), PCK1(pS90), 
INSIG1(pS207)/INSIG2(pS151), SREBP1 or non-specific lgG as a negative 
control. The staining of the tissue sections was quantitatively scored 
according to the percentage of positive cells and the staining inten- 
sity as described previously’. The following proportion scores were 
assigned to the sections: 0 if 0% of the tumour cells exhibited positive 
staining, 1 for O-1%, 2 for 2-10%, 3 for 11-30%, 4 for 31-70% and 5 for 
71-100%. In addition, the staining intensity was rated on ascale of 0-3: 
O, negative; 1, weak; 2, moderate; and 3, strong. The proportion and 
intensity scores were then combined to obtain a total score (range, 1-8) 
as described previously*’. Scores were compared with overall survival 
duration, defined as the time from date of diagnosis to that of death or 
last known follow-up examination. All patients had received standard 
therapies after surgery. 


Mouse studies 


One million Huh7 cells with or without gene editing in PCK1 and INSIG1/2 
were collected in 20 pp] DMEM with 33% matrigel and intrahepatically 


or subcutaneously injected into 6-week-old male BALB/c athymic nude 
mice. The injections were performed as described previously®. Seven 
mice per group in each experiment were used. Mice were euthanized 
28 days after injection. The liver of each mouse was dissected and 
then fixed in 4% formaldehyde and embedded in paraffin. The tumour 
volume was calculated using the formula: V=1/2a’b (V, volume; a, 
shortest diameter; b, longest diameter). The mice were treated in 
accordance with relevant institutional and national guidelines and 
regulations. 

The transgene HCC mouse model was established by overexpression 
of activated AKT combined with c-Met using hydrodynamic transfec- 
tion”. Wild-type FVB/N mice were subjected to hydrodynamic injection 
as previously described”. In brief, the plasmids pT3-EFla- Flag-PCK1 
(wild-type or S90A-mutant) (20 pg), pT3-EFla-HA-myr-AKT (20 pg) 
and pT3-EFla-V5-c-Met (20 pg) together with pCMV/sleeping beauty 
transposase (SB) (2.4 1g), ina ratio of 12.5: 12.5: 12.5: 1.5, were diluted in 
2 ml saline (0.9% NaCl), filtered through a 0.22-um filter and injected 
into the lateral tail vein of the male mice in 6s. After 14 weeks, the mice 
were euthanized and liver tumours were extracted. 

The use of the mice was approved by the Institutional Review Board 
at MD Anderson Cancer Center and the Institutional Animal Care and 
Use Committee (IACUC) of Zhejiang University and Qingdao Cancer 
Institute, China. The maximum tumour diameter permitted by the 
committees is 1.5 cm. Mice arriving in the animal facility were randomly 
put into cages with five mice each. No statistical methods were used to 
predetermine sample size. 


Statistics and reproducibility 

All statistical data are presented as means +s.d. All experiments were 
repeated at least twice independently with similar results. The mean 
values obtained inthe control and experimental groups were analysed 
for significant differences. Pairwise comparisons were performed using 
atwo-tailed t test. P values of less than 0.05 were considered significant. 
Unless stated otherwise, the experiments were not randomized and 
investigators were not blinded to allocation during experiments and 
outcome assessment. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Allsource data for immunoblotting are shown in Supplementary Fig. 1. 
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Extended Data Fig. 1| IGF1-induced AKT activation induces the translocation 
of PCK1 to the ER and the binding of PCK1 to INSIG1/2. a, Huh7 cells with or 
without expression of Flag-INSIGI (left) or Flag—INSIG2 (right) were treated with 
or without IGF1 (100 ng mI) for 1h. Animmunoprecipitation assay was 
performed using anti-Flag antibody, and immunoprecipitates of Flag—INSIG1 or 
Flag-INSIG2 were eluted with Flag peptide, separated using SDS-PAGE and 
stained with Coomassie Brilliant Blue. Selected peptide hits of proteins associated 
with Flag-INSIGI1 or Flag-INSIG2, identified through mass spectrometry, are 
shown. b, Hep3B cells were treated with or without IGF1 (100 ng mI’) for 1h. 

c, Huh7 cells expressing Flag-PCK1 or Flag-PCK2 were stimulated with or without 
IGF1 (100 ng mI’) for 1h. d, Huh7 cells expressing Flag-PCK1 or Flag-PCK2 were 
stimulated with or without IGF1 (100 ng mI”) for1h. Immunofluorescence 
analyses were performed with the indicated antibodies (top). The colocalization 
coefficients between the indicated proteins in the presence or absence of IGFlare 
shown (bottom). At least n=50 cells from each independent experiment were 
analysed and representative data are shown. Data are mean +s.d. **P< 0.001 
(two-tailed t-test). The regions in white boxes are shown at higher magnification 
onthe right. e, Whole cell lysate (WCL), cytosolic and ER fractions were prepared 
from Huh7 and Hep3B cells stimulated with or without IGF1 (100 ng mI) for 1h. 
Cellular fractions from equal numbers of cells were analysed using 
immunoblotting with the indicated antibodies. f, Huh7 cells expressing Flag- 
PCK1 or Flag-PCK2 were stimulated with or without IGF1 (100 ng mI’) for1h. ER 
fractions and total lysates were prepared for immunoblotting analyses with the 
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indicated antibodies. g, Total lysates were prepared from Huh7 cells pretreated 
with or without U0126 (20 uM), SP600125 (25 LM), SU6656 (4 UM) or MK-2206 

(10 uM) for 30 min before treatment with or without IGF1 (100 ng mI’) for 1h. 

h, Huh7 cells expressing wild-type HA-AKT1 or HA-AKT1-DN (K179A, T308A, 
$473A were treated with or without IGF1 (100 ng mI) for 1h. i, Huh7 cells were 
pretreated with or without U0126 (20 tM), SU6656 (4 uM), SP600125 (25 1M) or 
MK-2206 (10 uM) for 30 min before treatment with or without IGF1 (100 ng mI’) 
for 1h. ER fractions and total lysates were isolated for immunoblotting analyses 
with the indicated antibodies. j, Huh7 cells were stably transfected with a control 
vector or a vector expressing HA-AKT1-DN (K179A, T308A, S473A). The cells were 
treated with or without IGF1 (100 ng mI) for 1h. ER fractions and total lysates 
were isolated for immunoblotting analyses with the indicated antibodies. k, Huh7 
cells were pretreated with or without MK-2206 (10 1M) for 30 min and treated with 
or without IGF1 (100 ng mI) for 1h. Immunofluorescence analyses were 
performed with the indicated antibodies (left). The colocalization coefficients 
between the indicated proteins are shown (right). At least n = 50 cells from each 
independent experiment were analysed and representative data are shown. Data 
are mean+s.d.**P< 0.001 (two-tailed t-test). 1, Huh7 cells expressing Flag—PCK1 or 
Flag-PCK2 were transfected with or without HA-myr-AKT1. m, Huh7 and Hep3B 
cells were treated with or without IGF1 (100 ng mI’) for 1h. Inb,c, e-j, I, m, 
immunoprecipitation and immunoblotting analyses were performed with the 
indicated antibodies, and the experiments were repeated three times 
independently with similar results. 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2| AKT-mediated phosphorylation of PCK1 Ser90 is 
necessary and sufficient for the translocation of PCK1 to the ER and the 
binding of PCK1 to INSIG1/2, and inhibits the canonical role of PCK1in 
gluconeogenesis. a, A GST pull-down assay was performed by mixing purified 
His-PCK1 with purified GST or GST-AKT1. Immunoblotting analyses were 
performed as indicated. b, Left, schematic of AKT1 full-length and deletion 
mutants. Right, 293T cells were transfected with the indicated constructs, and 
GST pull-down assays and immunoblotting analyses with the indicated 
antibodies were performed. c, In vitro kinase assays were performed by mixing 
purified GST-PCK1 with purified active or inactive His-AKT1 in the presence of 
[y->P]ATP. Autoradiography and immunoblotting analyses were performed as 
indicated. d, In vitro kinase assays were performed by mixing purified His-PCK1 
with or without purified GST-AKT1in the presence of ATP. Mass spectrometric 
analysis of a tryptic fragment at m/z 805.91107 Da (-0.06 mmu/-0.08 ppm), 
which was matched with the +2 charged peptide 88-IESKTVIVTQEQR-100, 
suggested that PGK1Ser90 was phosphorylated. The Mascot score was 31, and 
the expectation value was 0.25. e, Alignment of protein sequences spanning PCK1 
Ser90 from different species. f, Alignment of PCK1 Ser90 to the non-canonical 
AKT-phosphorylated substrate motif (RXXS/T). The reported AKT substrates 
(TFEB, YAP1, CREB, CDK2, B-catenin, FAM129A, CDC25B and NPM) are shown. 
Red, phosphoacceptor residue; blue, basic residue (R). g, In vitro kinase assays 
were performed by mixing purified GST-PCK1 or GST-PCK1(S90A) and active 
His-AKT1 in the presence or absence of MK-2206 (10 1M) for 1h. Immunoblotting 
analyses were performed as indicated. h, Left, IHC analyses of human HCC 
samples were performed with the indicated antibodies in the presence or 
absence of a blocking peptide for PCK1(pS90). Right, Huh7 cells expressing 
Flag-PCK1 were treated with or without IGF1 (100 ng mI) for 1h. 
Immunoprecipitation and immunoblotting analyses were performed with the 
indicated antibodies in the presence or absence of a blocking peptide for 
PCK1(pS90). i, Huh7 cells expressing wild-type Flag-rPCK1 or Flag-rPCK1(S90A) 
were pretreated with or without MK-2206 (10 tM) for 30 min. The cells were 
treated with or without IGF1 (100 ng mI“) for 1h. Immunoprecipitation and 
immunoblotting analyses were performed with the indicated antibodies. 
Jj, PCK1-depleted Huh7 cells were reconstituted with the indicated 
shRNA-resistant PCK1 proteins. After transfection with HA-myr-AKT1, ER 
fractions and total cell lysate were prepared for immunoblotting analyses as 
indicated. k, Huh7 and Hep3B cells expressing the indicated Flag-rPCK1 proteins 
were treated with or without IGF1 (100 ng mI“) for 1h. ER fractions and total cell 
lysate were prepared for immunoblotting analyses as indicated. I, m, Genomic 
DNA was extracted from two individual clones of parental Huh7 or Hep3B cells 
with knock-in expression of PCK1(S90A). PCR products were amplified from the 


indicated DNA fragment (I) and separated on an agarose gel (m). n, Sequencing 
of parental Huh7 and Hep3B cells and two individual clones of parental cells with 
knock-in expression of PCK1(S90A). The red line indicates the sgRNA-targeting 
sequence. The black line indicates the protospacer adjacent motif (PAM). Blue 
arrows indicate mutated nucleotides. A mutated amino acid and its wild-type 
counterpart are indicated by the solid red box. o, p, Parental Huh7 and Hep3B 
cells and the indicated clones of Huh7 and Hep3B cells with knock-in expression 
of PCK1(S90A) were stimulated with or without IGF1 (100 ng mI’) for1h. ER 
fractions and total cell lysates were prepared and immunoblotting analyses were 
performed as indicated (0). Immunofluorescence staining of Huh7 cells was 
performed with the indicated antibodies (p, left). The colocalization coefficients 
between the indicated proteins are shown (p, right). At least n = 50 cells from 
each independent experiment were analysed and representative data are shown. 
Data are mean +s.d. **P< 0.001 (two-tailed t-test). q-s, Bacterially purified 
wild-type His-PCK1 or His-PCK1(S90A) on Ni-NTA agarose beads were incubated 
with or without purified active GST-AKT1 in the presence of ATP for an in vitro 
AKT kinase assay. Immunoblotting analyses were performed as indicated (q). 
After washing wild-type His-PCK1 or His-PCK1(S90A)-conjugated beads with 
PBS five times, the binding affinity of PCK1 to oxaloacetate (OAA) (r) and the 
relative PCK1 activity (s) were measured. Data are mean +s.d. (n= 6 biological 
replicates). **P< 0.001 (two-tailed t-test); NS, not significant (P= 0.923 (r); 
P=0.728 (s)).t-v, Wild-type His-PCK1 and His—PCK1(S90E) were purified from 
bacteria and Coomassie Brilliant Blue staining analyses were performed (t). 

The binding affinity of the His-PCK1 proteins to oxaloacetate (u) and the relative 
PCK1activity (v) were measured. Data are mean +s.d. (n=6 biological replicates). 
**P< (0.001 (two-tailed t-test). w, Bacterially purified wild-type GST-PCK1 or 
GST-PCK1(S90A) on glutathione agarose beads were incubated with or without 
active His-AKT1in the presence of ATP for an in vitro kinase assay. The 
GST-tagged proteins were then incubated with or without CIP (10 U) for 30 min at 
37 °C, followed by incubation with Flag/His-tagged INSIGI or INSIG2 purified 
from Huh7 cells for a pull-down assay. x, SFB-tagged wild-type PCK1 or 
PCK1(S90A) were pulled down from Huh7 cells treated with or without IGF1 

(100 ng mI) for 1h. These proteins were incubated with or without CIP (10 U) for 
30 min at 37 °C. Immunoblotting analyses were performed as indicated. 

y, Parental Hep3B cells and the indicated clones of cells with knock-in expression 
of PCK1(S90A) were stimulated with or without IGF1 for 1h. Immunoprecipitation 
and immunoblotting analyses were performed as indicated. z, Huh7 cells 
expressing /NS/G1 shRNA and INS/G2 shRNA were treated with or without IGF1 
(100 ng mI) for 1h (left) or transfected with HA-myr-AKT]1 (right). ER fractions 
and total cell lysate were prepared for immunoblotting analyses as indicated. All 
experiments were repeated three times independently with similar results. 
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Extended Data Fig. 3 | See next page for caption. 
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Extended Data Fig. 3 | INSIG1/2 is required for the translocation of PCK1to 
the ERand PCK1 functions as a protein kinase to phosphorylate INSIG1 
Ser207 and INSIG2 Ser151. a, Top, the topological structures of INSIG1 and 
INSIG2, which have 69% amino acid sequence identity and contain six 
transmembrane-spanning regions. Bottom, different INSIGI (left) or INSIG2 
(right) truncation mutants were expressed in 293T cells. These cells were 
treated with or without IGF1 (100 ng mI) for1h. Immunoprecipitation and 
immunoblotting analyses were performed with the indicated antibodies. 

b, Bacterially purified GST-PCK1 was incubated with or without active His-AKT1 
inthe presence of ATP for Lh. After the in vitro kinase assay, GST- 
PCK1-conjugated beads were washed with PBS five times and then incubated 
with purified Flag/His-INSIG1 or Flag/His—-INSIG2 in the absence or presence of 
aPCK1(pS90) peptide for 2 h. Immunoblotting analyses were performed with 
the indicated antibodies. c, Endogenous PCK1-depleted Huh7 cells with 
reconstituted expression of shRNA-resistant wild-type rPCK1 or rPCK1(C228S) 
were transfected with Flag-INSIGI (left) or Flag-INSIG2 (right), respectively. 
After being treated with or without IGF1 (100 ng mI’) for 1h, the cells were 
collected for immunoprecipitation and immunoblotting analyses as indicated. 
d, Bacterially purified wild-type His-PCK1, His-PCK1(S90A) or His- 
PCK1(C288S) on Ni-NTA agarose beads were incubated with or without active 
GST-AKT1in the presence of ATP for anin vitro AKT kinase assay. The beads 
were washed with PBS five times and incubated with SFB-INSIG2 in the presence 
of [y-?P]GTP. Autoradiography and immunoblotting analyses with the indicated 
antibodies were performed. e, In vitro kinase assays were performed by mixing 
purified His-PCK1o0n Ni-NTA agarose beads with purified active GST-AKT1in 
the presence of ATP for 1h. His-PCK1-conjugated Ni-NTA agarose beads were 
washed with PBS five times and then incubated with SFB-INSIG1 or SFB-INSIG2 
purified from Huh7 cells inthe presence of [y-*P]ATP or [y-”P]GTP for Lh. 
Autoradiography and immunoblotting analyses with the indicated antibodies 
were performed. f, Anin vitro kinase assay was performed as ine, except that the 
His-PCK1-conjugated beads were incubated with SFB-INSIGI1 purified from 
Huh7 cells inthe presence of GTP for 1h. Mass spectrometric analysis of atryptic 
fragment at m/z 764.40387 Da (-1.96 mmu/-2.57 ppm), which was matched with 
the +2 charged peptide 205-RSRSGLGLGITIAF-218, suggested that INSIG1 
Ser207 was phosphorylated. The Mascot score was 53, and the expectation 


value was 0.021. g, PCK1-mediated phosphorylation residues inthe cytosolic 
loop 2 of both INSIG1and INSIG2. h, Alignment of protein sequences spanning 
INSIG1 Ser207 and INSIG2 Ser151 from different species. i, Bacterially purified 
wild-type His-PCK10n Ni-NTA agarose beads was incubated with or without 
purified active GST-AKT1in the presence of ATP for 1h for anin vitro AKT kinase 
assay. The beads were then washed with PBS five times and incubated with or 
without wild-type SFB-INSIG2 or SFB-INSIG2(S151A) in the presence of [y-P] 
GTP. Autoradiography and immunoblotting analyses with the indicated 
antibodies were performed.j, IHC analyses of human HCC samples were 
performed with the indicated antibodies in the presence or absence ofa 
blocking peptide for INSIG1(pS207) and INSIG2(pS151). k, Huh7 cells expressing 
Flag-INSIG1 (top) or Flag-INSIG2 (bottom) were treated with or without IGF1 
(100 ng mI’) for1h. Immunoprecipitation and immunoblotting analyses were 
performed with the indicated antibodies in the presence or absence ofa 
blocking peptide for INSIG1(pS207) and INSIG2(pS151). I, Enzyme kinetics plots 
of velocity relative to GTP concentration between purified wild-type His-PCK1 
and His-PCK1(S90E). The V,,,, and K,, of PCK1in phosphorylating an INSIG1 
peptide at Ser207 were calculated (n= 6). Data are mean +s.d. m,n, Bacterially 
purified wild-type His-PCK1, His-PCK1(S90E) or His-PCK1(S90D) on Ni-NTA 
agarose beads were incubated with wild-type SFB-INSIG1 or SFB-INSIG1(S207A) 
(m) or wild-type SFB-INSIG2 or SFB-INSIG2(S151A) (n) in the presence of [y-”P] 
GTP. Autoradiography and immunoblotting assays with the indicated 
antibodies were performed. o, Wild-type Flag-INSIG1 or Flag-INSIG1(S207A) 
(left) or wild-type Flag-INSIG2 or Flag-INSIG2(S151A) (right) were transfected 
into Huh7 cells. Huh7 cells were stimulated with or without IGF1 (100 ng mI’) for 
1h. Immunoprecipitation and immunoblotting analyses were performed with 
the indicated antibodies. p, Wild-type Flag—-INSIG1 or Flag-INSIG1(S207A) (left) 
or wild-type Flag-INSIG2 or Flag-INSIG2(S151A) (right) were co-transfected 
with or without HA-myr-AKT1into Huh7 cells. Immunoprecipitation and 
immunoblotting analyses were performed with the indicated antibodies. 

q, Parental Huh7 cells and the indicated clones of cells with knock-in expression 
of PCK1(S90A) were transfected with SFB-INSIGI (left) or SFB-INSIG2 (right). 
These cells were then stimulated with or without IGF1 (100 ng mI*) for 1h. All 
experiments were repeated three times independently with similar results. 
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Extended Data Fig. 4 | Generation of Huh7 and Hep3B cells with knock-in 
expression of both INSIG1(S207A) and INSIG2(S151A), and PCK1-mediated 
phosphorylation of INSIG1/2 reduces the binding of oxysterols to INSIG1/2. 

a, b, Genomic DNA was extracted from two individual clones of parental Huh7 and 
Hep3B cells with knock-in expression of INSIG1(S207A) (a) and INSIG2(S151A) (b). 
PCR products amplified from the indicated DNA fragments were separated on an 
agarose gel. c, d, Sequencing of parental and two individual clones of parental 
Huh7 and Hep3B cells with knock-in expression of INSIG1(S207A)/INSIG2(S151A) 
double mutants. The red line indicates the sgRNA-targeting sequence. The black 
line indicates the PAM. Blue arrows indicate mutated nucleotides. A mutated 
amino acid and its wild-type counterpart are indicated by the solid red box. e, Top, 
Flag/His-tagged INSIG2 immunoprecipitated and purified from Huh7 cells was 
incubated with the indicated GST-PCK1 proteins with or without active GST-AKT1 
in the presence of ATP and GTP for 1h. Immunoblotting analyses were performed 
with the indicated antibodies. Bottom, INSIG2-conjugated Ni-NTA agarose beads 
were washed and incubated with 400 nM [?H]25-hydroxycholesterol. Specifically 
bound [?H]25-hydroxycholesterol was measured (n= 6). **P< 0.001 (two-tailed 
t-test); NS, not significant (P=0.875, 0.846, 0.969, 0.924 (left to right)). f, g, Top, 
Flag/His-tagged wild-type INSIG1 or INSIG1(S207A) (f) or Flag/His-tagged 
wild-type INSIG2 or INSIG2(S151A) (g) purified from Huh7 cells were incubated 
with purified wild-type GST-PCK1, GST-PCK1(S90D) or GST-PCK1(S90E) in the 
presence of GTP for 1h. Immunoblotting analyses were performed with the 
indicated antibodies. Bottom, the INSIGI or INSIG2 proteins on Ni-NTA agarose 
beads were washed with PBS five times and incubated with 400 nM 
[(?H]25-hydroxycholesterol. Specifically bound [?H]25-hydroxycholesterol was 
measured (n=6).**P< 0.001 (two-tailed t-test); NS, not significant (P=0.823, 
0.445, 0.185 (left to right) (f); P=0.320, 0.196, 0.735 (left to right) (g)). h, Top, 


Flag/His-tagged wild-type INSIG2 or INSIG2(S151A) purified from Huh7 cells were 
incubated with purified wild-type GST-PCK1 with or without purified active 
GST-AKT1in the presence of ATP and GTP for Lh. Immunoblotting analyses were 
performed with the indicated antibodies. Bottom, the INSIG2 protein on Ni-NTA 
agarose beads was washed with PBS five times and incubated with 400 nM 
[3H]25-hydroxycholesterol. Specifically bound [?H]25-hydroxycholesterol was 
measured (n= 6). **P< 0.001 (two-tailed t-test); NS, not significant (P= 0.682, 
0.947 (left to right)). i,j, Flag/His-tagged wild-type INSIG1 or INSIG1(S207A) (i) or 
wild-type INSIG2 or INSIG2(S151A) (j) were purified from Huh7 treated with or 
without IGF1 (100 ng mI’) for 12 h. Immunoblotting analyses were performed with 
the indicated antibodies (left). The INSIG1 or INSIG2 proteins on Ni-NTA agarose 
beads were incubated with the indicated concentration of 
[(?H]25-hydroxycholesterol. Specifically bound [?H]25-hydroxycholesterol was 
measured (n=6) (right). k, I, Top, Flag/His-tagged INSIGI (k) or INSIG2 (1) was 
expressed in parental Huh7 cells and the Huh7 cells with knock-in expression of 
PCK1(S90A). After these cells were treated with or without IGF1 (100 ng mI’) for 
12h, immunoblotting analyses were performed with the indicated antibodies. 
Bottom, the immunoprecipitated and purified INSIG1 or INSIG2 was incubated 
with the indicated concentration of [7H]25-hydroxycholesterol. Specifically 
bound [?H]25-hydroxycholesterol was measured (n=6).m, n, Top, the 
immunoprecipitated and purified Flag/His-tagged wild-type INSIG1 or 
INSIG1(S207E) (m) or Flag/His-tagged wild-type INSIG2 or INSIG2(S151E) 

(n) from Huh7 cells was incubated with the indicated concentration of 
[?H]25-hydroxycholesterol. Specifically bound [?H]25-hydroxycholesterol was 
measured (n=6).**P< 0.001 (two-tailed t-test). Data in e-n are mean +s.d. All 
experiments were repeated three times independently with similar results. 
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Extended Data Fig. 5| PCK1-mediated phosphorylation of INSIG1/2 
promotes the translocation of SCAP from the ER to the Golgi apparatus. 

a, Parental Hep3B cells and the indicated clones of Hep3B cells with knock-in 
expression of INSIG1(S207A)/INSIG2(S151A) double mutants were transfected 
with Myc-SCAP and stimulated with or without IGF1 (100 ng mI) for 16h. 
Immunoprecipitation and immunoblotting analyses with the indicated 
antibodies were performed. b, Validation of the specificity of the SCAP 
antibody. SCAP shRNA was expressed in 293T cells. Immunoblotting analyses 
were performed with the indicated antibodies. c, Parental Huh7 or Hep3B cells 
and the indicated clones with knock-in expression of PCK1(S90A) were 
transfected with the indicated plasmids and stimulated with or without IGF1 
(100 ng mI?) for 16 h. Ni-NTA pull-down assays, immunoprecipitation and 
immunoblotting analyses with the indicated antibodies were performed. 

d, PCK1-depleted Huh7 (left) and Hep3B (right) cells were stably transfected 
with shRNA-resistant wild-type rPCK1 or rPCK1(S90A). Immunoblotting 
analyses were performed with the indicated antibodies. e, f, PCK1-depleted 
Huh7 (left) and Hep3B (right) cells with reconstituted expression of 
shRNA-resistant wild-type rPCK1and rPCK1(S90A) (e) or rPCK1(C288S) (f) were 
transfected with vectors expressing Flag-INSIG1 and His-INSIG2. The cells 
were then stimulated with or without IGF1 (100 ng mI*) for 16h. Ni-NTA 
pull-down assays, immunoprecipitation and immunoblotting analyses with the 
indicated antibodies were performed. g, h, Parental Huh7 cells and the 
indicated clones of Huh7 cells with knock-in expression of INSIG1(S207A)/ 
INSIG2(S151A) were stimulated with or without IGF1 (100 ng mI‘) for 16 h. The 
cells were then subjected to homogenization and cell fractionation using 
gradient centrifugation. Immunoblotting analyses were performed with the 
indicated antibodies (g). The relative distribution of each protein in different 
fractions was quantified by densitometric analysis of the blots (n=3) (h). 

i,j, Parental Huh7 cells and the indicated clones of Huh7 cells with knock-in 
expression of PCK1(S90A) were stimulated with or without IGF1 (100 ng mI?) 
for 16h. The cells were then subjected to homogenization and cell fractionation 
using gradient centrifugation. Immunoblotting analyses were performed with 
the indicated antibodies (i). The relative distribution of each proteinin 
different fractions was quantified by densitometric analysis of the blots (n=3) 
(j). k, Colocalization coefficients between SCAP and calnexin in the indicated 
cells. At least n=50 cells from each independent experiment were analysed and 
representative data are shown. **P< 0.001 (two-tailed t-test). 1, Parental Huh7 


cells and the indicated clones of Huh7 cells with knock-in expression of 
INSIG1(S207A)/INSIG2(S151A) were stimulated with or without IGF1 for 16h. 
Immunofluorescence analyses were performed with the indicated antibodies. 
The white arrows indicate the Golgi-localized SCAP. Scale bar, 20 pm. 

m, Colocalization coefficients between SCAP and golgin-97. At least n=50 cells 
from each independent experiment were analysed and representative data are 
shown. **P<0.001 (two-tailed ¢-test).n, 0, Top, parental Huh7 cells and the 
indicated clones of Huh7 cells with knock-in expression of PCK1(S90A) were 
stimulated with or without IGF1 (100 ng mI”) for 16 h. Immunofluorescence 
analyses were performed with the indicated antibodies. The white arrows 
indicate the Golgi-localized SCAP. Bottom, colocalization coefficients 
between SCAP and calnexin (n) or golgin-97 (0). At least n=50 cells from each 
independent experiment were analysed and representative data are shown. 
**P< 0.001 (two-tailed t-test). Scale bars, 20 pm. p,q, Top, parental Huh7 cells 
and the indicated clones of Huh7 cells with knock-in expression of 
INSIG1(S207A)/INSIG2(S151A) double mutants were stably transfected with 
Myc-SCAP and stimulated with or without IGF1 (100 ng mI") for 16h. 
Immunofluorescence analyses were performed with the indicated antibodies. 
The white arrows indicate the Golgi-localized SCAP. Bottom, colocalization 
coefficients between Myc-SCAP and calnexin (p) or golgin-97 (q). Atleast 
n=50 cells from each independent experiment were analysed and 
representative data are shown. **P< 0.001 (two-tailed t-test). Scale bars, 20 um. 
r, Parental Huh7 and Hep3B cells and the indicated clones of these cells with 
knock-in expression of INSIG1(S207A)/INSIG2(S151A) were stimulated with or 
without IGF1 (100 ng mI’) for 16h. N, Nterminus of SREBP1;P, precursor of 
SREBP1.s, Parental Huh7 cells and the indicated clones of Huh7 cells with 
knock-in expression of INSIG1(S207A)/INSIG2(S151A) were transiently 
transfected with vectors expressing B-galactosidase and an SRE-driven 
luciferase reporter. Twenty-four hours later, the cells were stimulated with or 
without IGF1 (100 ng mI) for 16h. The relative SRE luciferase activity after 
normalization to B-galactosidase activity is shown (n=6).**P< 0.001 
(two-tailed t-test). t, Parental Huh7 and Hep3B cells and the indicated clones of 
these cells with knock-in expression of INSIG1(S207A)/INSIG2(S151A) were 
stimulated with or without IGF1 (100 ng mI’) for 16 h. Immunoblotting analyses 
were performed with the indicated antibodies. Datainh,j,k, m-q,s are 

mean +s.d. All experiments were repeated three times independently with 
similar results. 


a Huh7 Hep3B b c d »-IGF49+IGF1 
ng or ooh eS ge” SREBP1 + Sars ee CA SPAM 
PCKL AT _c@OP"_cQOP_ VK QPP" i WSS = 
IGFi- +> +> + - +7 tS B84 wee : 5 
ee X @ G 
- - P aS 233 223 3 3 
- 3 @2 222 2 2 
- = —N - est ie —£1 1 1 
WB: SREBP1 -5 20 ns 0 9 0 
=== = = «= Sq PCK1 nw x PORT YG, ON oh PCKT WS oN oh PCR EXO PORIGR, AS 
WB: AKT pS473 as rel eos) eorhaoh eorhzooh cor eooh 
<8 g IGF1+1GF1 2 “IGFA+IGF1 i =e a 
< Se £23, = ul 
[s) S 
e WB: Tubulin a5 58 528 Nc 
Huh7 4 Hep3B 4 6 § gsz? 
ae or oooh aah! f sreBr1 “'GFI+IGF1 cpepey O85 2851 
PCKL NS 5 x xe eS ee gs 2se 
IGF SSE 34) = 34 = B29 BFE a 
NE ee eee 2 $3 283 @ PCKIAS B POKI Gch ot == [-N 
WB: FAS 22 22> s ategoh™ a WB: SREBP1 
i eZ OZ h ol —— = 
WB: SCD1 oe i ae - IGF 1=+1GF1 IGF 1-+1GF1 . RICE IGr Q |B: INSIG1/2 pS207/S151 
i i jon-small cell lung = te 
- —-— = _-_ —- = Apa 0 nN q@ Melanoma cell line Glioblastoma cell line g A | a 
WB: AKT pS473 INSIG1/2 § laa PCK1 Wk 8 CHL-1 8 U87 s cancer cellline WB: INSIG1 
ee eee" ama | s90A S90A 2S/A 2S/A 3883 3833) ass S| ——_ << 
WB: AKT PCK1.WT -C1_ -C2 INSIG1/2WT_-C1_ _-C2 Oof> OSE o8> D |we: INSIG1/2 p$207/S151 
ween e eee ee (GEIS eee ae (GE = "oe =o ae = gh2 # 882 2 3e z asthe 
WB: Tubulin = 285, 285, 8s == == 
gee BEE ae ia WB: INS| 
; fe go @ @ o-- 
5 WB: SREEre = WB: SREBP2 HA-PCK1 g@oSP HATPCK1 Qo HATPCK1 oe op WB: Tubulin 
- 7] = 
3 B m 
Q\we: INSIG1/2 pS207/S151 2 |we: INSIG1/2 pS207/S151 »-IGF1=+I1GF1 «-IGF1=+IGF1 <-IGF1=+1GF1 »-IGF1:+1GF1 
| === = z HMGCR HMGCS1 LDLR FDFT1 
= WB: INSIG1 = WB: INSIG1 —)) = <8 3° Es 
z ) ae 3| I pecelta £36 £38 £36 
WB: SREBP1 ®|WB: INSIG1/2 pS207/S151_ | WB: INSIG1/2 pS207/S151 B<4 B <4 B<4 
meece 2 Zz OZ OZ OZ 
WB: Tubulin a| See | 2 © G2 & 2 
: = WB: INSIG2 7 WB: INSIG2 0 0 0 
Hep3B PCK1 SS PCKI gS PCK1 gS 
# se” WB: PCK1 pS90 
INSIG12 ss eueeeee eters “IGF 1=+IGF4 IGF 10+1GF1 oAGF1:+1GF1 
Ez Pe WB: PCK1 HMGCR HMGCS1 LDLR FDFT1 
2 a -_——_— = 38 eS 38 ae 38 38 = 
_—=—-N s . fs : = P 
WB: SREBP1 wes HSI £36 £36 $56 $36 
— ; 33' ast es es! 
WB: Tubulin WB” AKT WB: AKT ee2 é &z2 2&2 & £2 
Cede dal EO EO EF i 
WB: Tubulin WB: Tubulin insio1/2° aw oN (Co INSIG1/2 we CS ch INSIG1/2 Gp GN Ch INSIGIZ” QS A oh 
fe) INSIG1/2 WT INSIG1/2 2S/A-C1 __ INSIG4/2 2S/A-C2 alta ale al ale at 2 
Top(Light) Bottom( (Heavy) Top(Light) Bottom(Heavy) Top(Light) Bottom(Heavy) P jo 
5 8 -2S/A- -2S/A- 
Fraction? 234 5 6 7 8910111213 1234567 891011213 1234567 6891011213 £215, NSIGI2WT US KOe tas! INBICHS-28IN-Ge 
A ee --e@eee- - -- -----eeee- ----- +--+ ee8ee- SIE $ 
5 @ 9 r 3\2 1 ~Calnexin 
3 WB: SCAP WB: SCAP WB: SCAP =|2 
2 --eece- -- seen ew --s2ee- Fe 90.5 ~Golgin97 
€ WB: Calnexin WB: Calnexin WB: Calnexin 5/8 5 ~SCAP 
6| -eeeen— Se ee -eecece-.--- oO 
ao 
z WB: Golgin97 WB: Golgin97 Merona? : 12345678 9101123 12345678 910123 12345678 910111213 
i... -eoee----—... s|gts 
oO 5 * S/o 
a ees WE SCAR oe ee Sie ; ‘| Calnexin 
= meets < ~Golgin97 
a WB: Cainexin WB: Calnexin WB: Calnexin_ ge \ ocean 
S| -eeewe-- —-eeecco—— ee qj as 0 - io 
WB: Golgin97 WB: Golgin97 WB: Golgin97 ie 30 -123456789101121312345678 910123 1234567 8 910111213 
q PCK1 WT PCK1 S90A-C1 PCK1 S90A-C2 7 Fraction Number Fraction Number Fraction Number 
Top(Light) Bottom(Heavy) Top(Light) Bottom(Heavy) Top(Light) Bottom(Heavy) r e ey 5 
Fraction’ 2345 67 6 910111213 12345 67 691011213 1234567 6891011213 s/s" PER WT eae roe 
2... eee ee em@een |... wo eo meee Sie 4 ; Calnexin 
3 WB: SCAP WB: SCAP WB: SCAP Bloo5 ~Golging7 
2 _ - ceece— ---see0ce- Seeees =f +SCAP 
— WB: Calnexin WB: Calnexin WB: Cainexin Z\/o 0 
S| owe -eeeee- -2ecen-- 12345678 910IN23 123456789103 12345678 910111213 
WB: Golgin97 WB: Golgin97 WB: Golgin97 el. 5 
§| ewww en mmmmwen---- ~wwwww-- —-— 3 24 poe Snein 
3 WB: SCAP WB: SCAP WB: SCAP als bs nee 
3 weaneue _=<eese— .-seeece- 8) S05 Golgin97 
no] WB: Calnexin WB: Calnexin WB: Calnexin 3 g 0 2 “= SCAR! 
2 ns ~eemmewn- ---. Te Alo 
a = : © 123456789101112I3 1234567891023 12345678 910111213 
WB: Golging7 WB: Golgin97 WB: Golgin97 Fraction Number Fraction Number Fraction Number 


s Normal Medium Lipid Depletion 


Normal Medium 


Lipid Depletion 


SCAP_ Calnexin _ Merge SCAP_Calnexin Merge 


INSIG 1/2 
WT 


INSIG1/2 
2S/A-C1 


INSIG1/2) 
2S/A-C2 


PCK1 
S90A-C1 


PCK1 
S90A-C2) 


SCAP _ Calnexin 


fs 


SCAP _ Calnexin 


Merge 


i= 


Correlation Coefficient 


Lipid Depletion 


Vv Normal Medium Lipid Depletion Normal Medium 
SCAP_ Golgin97 Merge Sa Golgin97 _ Merge SCAP_Golgin97 _ Merge 
INSIG1/2 he 
WT ee 
& 
INSIG1/2 » PCK1 
2S/A-C1 re S90A-C1) 
INSIG1/2 PCK1 
2S/A-C2 S90A-C2} 


Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6 | PCK1-mediated INSIG1/2 phosphorylationis 
required for IGF1- induced SREBP activation for lipogenesis and does not 
affect the lipid depletion-induced translocation of SCAP from the ER tothe 
Golgi apparatus. a, Parental Huh7 (left) and Hep3B (right) cells and the 
indicated clones with knock-in expression of PCK1(S90A) were stimulated with 
or without IGF1 (100 ng mI’) for 16 h. Immunoblotting analyses were 
performed as indicated. b, Parental Huh7 cells and the indicated clones of Huh7 
cells with knock-in expression of PCK1(S90A) were stimulated with or without 
IGF1 (100 ng mI) for 16 h. Immunofluorescence analyses were performed as 
indicated. Scale bar, 20 pm. c, Parental Huh7 cells and the indicated clones with 
knock-in expression of PCK1(S90A) were transiently transfected with vectors 
expressing B-galactosidase and an SRE-driven luciferase reporter and 
stimulated with or without IGF1 (100 ng mI’) for 16 h. The relative SRE 
luciferase activity after normalization to B-galactosidase activity is shown 
(n=6). Dataare mean+ts.d.**P< 0.001 (two-tailed t-test). d, Parental Huh7 cells 
and the indicated clones with knock-in expression of PCK1(S90A) were 
stimulated with or without IGF1 (100 ng mI) for 16 h. The mRNA expression 
levels for SREBP target genes were measured using quantitative PCR (n=6). 
Data are mean ¢s.d.**P< 0.001 (two-tailed t-test). e, Parental Huh7 (left) and 
Hep3B (right) cells and the indicated clones with knock-in expression of 
PCK1(S90A) were stimulated with or without IGF1 (100 ng mI) for 16h. 
Immunoblotting analyses were performed as indicated. f, Parental Huh7 cells 
and the indicated clones with knock-in expression of INSIG1(S207A)/ 
INSIG2(S151A) double mutants (left) or PCK1(S90A) (right) were stimulated 
with or without IGF1 (100 ng mI) for 16 h. The mRNA expression levels for 
SREBF1 were measured using quantitative PCR (n=6). Dataare mean +s.d. 
*P=0.002,**P< 0.001 (two-tailed t-test). g, Parental Huh7 cells andthe 
indicated clones with knock-in expression of PCK1(S90A) were stimulated with 
or without IGF1 (100 ng mI’) for 16 h. The incorporation of “C-glucose into 
triglycerides (left) and fatty acids (right) was measured (n=6). Data are 

mean +s.d.and were compared between groups using a two-tailed ¢-test. 
*P=0.014, 0.012 (left to right); **P = 0.004, 0.001 (left to right). h, Endogenous 
PCK1-depleted CHL-1 human melanoma cells, U87 human glioblastoma cells 
and H1993 human non-small-cell lung cancer cells with reconstituted 
expression of shRNA-resistant wild-type HA-rPCK1 or HA-rPCK1(S90A) were 
stimulated with or without IGF1 (100 ng mI) for 16 h. The incorporation of 
4C-glucose into fatty acids was measured (n= 6). Dataare mean +s.d. and were 


compared between groups using a two-tailed ¢-test. *P= 0.011, **P< 0.001. 

i,j, Parental Huh7 or Hep3B cells and the indicated clones with expression of 
PCK1(S90D) (i) or INSIG1(S207D)/INSIG2(S151D) double mutants (2S/D) (j) 
were collected for immunoblotting analyses as indicated. k, l, Parental Huh7 
cells and the indicated clones with knock-in expression of PCK1(S90A) (k) or 
INSIG1(S207A)/INSIG2(S151A) (I) were stimulated with or without IGF1 

(100 ng mI) for 16 h. Immunoblotting analyses were performed as indicated. 
m,n, Parental Huh7 cells and the indicated clones with knock-in expression of 
PCK1(S90A) (m) or INSIG1(S207A)/INSIG2(S151A) (n) were stimulated with or 
without IGF1 (100 ng mI’) for 16 h. The mRNA expression levels for SREBP2 
target genes were measured using quantitative PCR (n=6). Dataare mean +s.d. 
**P< 0.001 (two-tailed ¢-test). o, p, Parental Huh7 cells and the indicated clones 
with knock-in expression of INSIGI(S207A)/INSIG2(S151A) double mutants 
were incubated with a lipid-depleted medium and treated with lovastatin (an 
inhibitor of HMGCR)to inhibit cholesterol synthesis for 16 h. The cells were 
subjected to homogenization and cell fractionation using gradient 
centrifugation. Immunoblotting analyses were performed as indicated (0). 
The relative distribution of each protein in different fractions was quantified 
by densitometric analysis of the blots (n=3) (p). Dataare mean+s.d.q,r, 
Parental Huh7 cells and the indicated clones with knock-in expression of 
PCK1(S90A) were incubated with or without lipid-depleted medium for 16h. 
The cells were subjected to homogenization and cell fractionation using 
gradient centrifugation. Immunoblotting analyses were performed as 
indicated (q). The relative distribution of each protein in different fractions 
was quantified by densitometric analysis of the blots (n= 3) (r). Dataare 

mean +s.d.s-x, Parental Huh7 cells and theindicated clones with knock-in 
expression of INSIG1(S207A)/INSIG2(S151A) double mutants (s, v) or 
PCK1(S90A) (t, w) were incubated with or without lipid-depleted medium for 
16h. Immunofluorescence analyses were performed as indicated. Scale bars, 
20 um. The white arrows indicate the Golgi-localized SCAP. The colocalization 
coefficients between SCAP and the ER marker calnexin (u) and the Golgi 
apparatus marker golgin-97 (x) are shown. At least n=50 cells from each 
independent experiment were analysed and representative data are shown. 
Data are mean+s.d. and were compared between groups using a two-tailed 
t-test. NS, not significant (P= 0.788, 0.514, 0.689, 0.693 (left to right) (u); 
P=0.976, 0.606, 0.750, 0.940 (left to right) (x)). Allexperiments were repeated 
three times independently with similar results. 
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Extended Data Fig. 7 | PCK1-mediated phosphorylation of INSIG1/2, which 
does not affect lipid-depletion-induced activity of SREBP1, promotes rapid 
activation of SREBP1 and degradation of INSIG1. a, Parental Huh7 cells andthe 
indicated clones of Huh7 cells with knock-in expression of INSIG1(S207A)/ 
INSIG2(S151A) double mutants (top) or knock-in expression of PCK1(S90A) 
(bottom) were incubated with or without lipid-depleted medium for 16h. 
Immunoblotting analyses were performed with the indicated antibodies. 

b, Parental Huh7 cells and the indicated clones of Huh7 cells with knock-in 
expression of NSIG1(S207A)/INSIG2(S151A) double mutants (left) or knock-in 
expression of PCK1(S90A) (right) were incubated with or without lipid-depleted 
medium for 16h. Immunofluorescence analyses were performed with the 
indicated antibodies. c, Parental Huh7 cells and the indicated clones of Huh7 
cells with knock-in expression of INSIG1(S207A)/INSIG2(S151A) double mutants 
(left) or knock-in expression of PCK1(S90A) (right) were transiently transfected 
with vectors expressing B-galactosidase and an SRE-driven luciferase reporter. 
Luciferase activity was determined after the cells were incubated with or 
without lipid-depleted medium for 16h. The relative SRE luciferase activity after 
normalization to B-galactosidase activity is shown (n=6). NS, not significant 
(P=0.965, 0.699, 0.967, 0.767 (left to right)). d, Huh7 cells expressing Flag— 
INSIG1 and His-INSIG2 were stimulated with or without IGF1 (100 ng mI’) for 

16 hinthe presence of 40 nM or120 nM cholesterol or 25-hydroxycholesterol. 
Ni-NTA pull-down assays, immunoprecipitation and immunoblotting analyses 
were performed with the indicated antibodies. 25-HC, 25-hydroxycholesterol. 
e, Parental Huh7 cells and the indicated clones of Huh7 cells with knock-in 
expression of both INSIG1(S207D) and INSIG2(S151D) (left) or knock-in 
expression of PCK1(S90D) (right) were incubated with 40 nM or120nM 
cholesterol or 25-hydroxycholesterol for 16h. Immunoprecipitation and 
immunoblotting analyses were performed with the indicated antibodies. 

f, Huh7 cells were transiently transfected with vectors expressing 
B-galactosidase and an SRE-driven luciferase reporter. Twenty-four hours later, 
the cells were stimulated with or without IGF1 (100 ng mI) for 16 hinthe 
presence of 40nM or 120 nM cholesterol or 25-hydroxycholesterol. The relative 
SRE luciferase activity after normalization to B-galactosidase activity is shown 
(n=6).*P=0.001, **P< 0.001 (two-tailed t-test); NS, not significant (P=0.921, 
0.579 (left to right)). g, Parental Huh7 cells and the indicated clones of Huh7 cells 
with knock-in expression of INSIG1(S207D)/INSIG2(S151D) double mutants (left) 


or knock-in expression of PCK1(S90D) (right) were transiently transfected with 
vectors expressing -galactosidase and an SRE-driven luciferase reporter. 
Luciferase activity was determined after the cells were incubated with 40 nM or 
120 nM cholesterol or 25-hydroxycholesterol for 16h. The relative SRE luciferase 
activity after normalization to B-galactosidase activity is shown (n= 6). 
*P=0.002, **P<0.001 (two-tailed t-test); NS, not significant (P= 0.642, 0.957, 
0.842, 0.372 (left to right)). h, Huh7 cells were treated with IGF1 (100 ng mI) for 
the indicated time periods. Immunoprecipitation and immunoblotting 
analyses were performed with the indicated antibodies. i, Parental Huh7 cells 
and the indicated clones of Huh7 cells with knock-in expression of PCK1(S90A) 
(left) or INSIGI(S207A)/INSIG2(S151A) double mutants (right) were treated with 
or without IGF1 (100 ng mI’) for 48 h. Immunoblotting analyses were performed 
with the indicated antibodies.j, k, Serum-starved parental Huh7 (j) and Hep3B 
(k) cells or the indicated clones with knock-in expression of INSIGI(S207A)/ 
INSIG2(S151A) double mutants were stimulated with or without IGF1 

(100 ng mI’) inthe presence or absence of CHX (100 pg mI’) for the indicated 
time periods. Immunoblotting analyses were performed with the indicated 
antibodies (left). The relative abundance of remaining INSIG1 or INSIG2 protein 
was quantified (right) (n =6).**P< 0.001 (two-tailed ¢test); NS, not significant 
(P=0.721, 0.637 (left to right) (j); P= 0.310, 0.853 (left to right) (k)).1, Huh7 cells 
expressing Flag-tagged wild-type INSIG1 (top) or INSIG1(S207D) (bottom) were 
treated with CHX (100 pg mI”) for the indicated time periods. Immunoblotting 
analyses were performed with the indicated antibodies (left). The relative 
abundance of the remaining INSIGI1 protein was quantified (right) (n= 6). 

**P< 0.001 (two-tailed ttest).m, Huh7 cells expressing Flag-tagged wild-type 
INSIG2 (top) or INSIG2(S151D) (bottom) were treated with CHX (100 pg mI) for 
the indicated time periods. Immunoblotting analyses were performed with the 
indicated antibodies (left). The relative abundance of the remaining INSIG2 
protein was quantified (right) (n=6).**P< 0.001 (two-tailed t-test); NS, not 
significant (P=0.395, 0.973, 0.636, 0.882 (left to right)).n, Endogenous 
PCK1-depleted Huh7 cells with reconstituted expression of shRNA-resistant 
wild-type rPCK1 or rPCK1(S90A) were treated with IGF1 (100 ng mI’) for the 
indicated time periods. Immunoprecipitation and immunoblotting analyses 
were performed with the indicated antibodies. Datainc,f,g,j-maremean+s.d. 
All experiments were repeated three times independently with similar results. 
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Extended Data Fig. 8 | PCK1-mediated phosphorylation of INSIG1 Ser207 
and INSIG2 Ser151 and SREBP1 activation are induced by oncogene- or 
growth-factor-mediated activation of AKT inseveral cancer types. a,b, e-l, 
Cells were transfected with the indicated plasmids and immunoprecipitation 
or immunoblotting analyses were performed with the indicated antibodies. 

a, Endogenous PCK1-depleted HL7702 (left) and THLE-2 (right) cells with 
reconstituted expression of shRNA resistant wild-type HA-rPCK1 or 
HA-rPCK1(S90A) were treated with or without IGF1 (100 ng ml) for 16h. 

b, Endogenous PCK1-depleted HL7702 (left) and THLE-2 (right) cells with 
reconstituted expression of shRNA resistant wild-type HA-rPCK1 or HA- 
rPCK1(S90D) were analysed. c, C57BL/6J male mice (eight-week-old) fasted for 
24 hwere intraperitoneally injected with or without glucose (1g per kg body 
weight). After 6h, the mouse livers were dissected forimmunoprecipitation 
and immunoblotting analyses. d, HL7702, THLE-2, Huh7 and Hep3B cells were 
analysed by Ni-NTA pull-down, immunoprecipitation and immunoblotting 
assays. e, Endogenous PCK1-depleted HL7702, THLE-2, Huh7 and Hep3B cells 
with reconstituted expression of shRNA resistant wild-type HA-rPCK1or 
HA-rPCK1(S90A) were analysed. f, Parental Huh7 cells and the indicated clones 
with knock-in expression of PCK1(S90A) (left) or INSIG1(S207A)/INSIG2(S151A) 
double mutants (right) were transfected with or without Flag-KRAS(G12V). 

g, Endogenous PCK1-depleted CHL-1, U87 and H1993 cells with reconstituted 
expression of shRNA resistant wild-type HA-rPCK1 or HA-rPCK1(S90A) were 
transfected with the indicated plasmids. h, Parental Huh7 cells and the 


indicated clones with knock-in expression of PCK1(S90A) (left) or 
INSIG1(S207A)/INSIG2(S151A) double mutants (right) were transfected with or 
without constitutively active Flag-IGF1IR(V922E). i, Endogenous 
PCK1-depleted CHL-1, U87 and H1993 cells with reconstituted expression of 
shRNA resistant wild-type HA-rPCK1 or HA-rPCK1(S90A) were transfected 
with the indicated plasmids. j, Endogenous PCK1-depleted CHL-1, U87 and 
H1993 cells with reconstituted expression of shRNA resistant wild-type HA- 
rPCK1 or HA-rPCK1(S90A) were transfected with the indicated plasmids and 
treated with or without PDGF (30 ng mI’) for 16h. k,m, Endogenous 
PCK1-depleted Huh7, Hep3B, SNU-398 and SNU-475 cells with reconstituted 
expression of shRNA resistant wild-type HA-rPCK1 or HA-rPCK1(S90A) were 
analysed by immunoprecipitation and immunoblotting analyses with the 
indicated antibodies (k). The cells were plated and then collected and counted 
for 3 days (n= 6) (m). Dataare mean+s.d.**P< 0.001 (two-tailed t-test). 

In, Endogenous PCK1-depleted Huh7, Hep3B, SNU-398 and SNU-475 cells with 
reconstituted expression of shRNA resistant wild-type HA-rPCK1 or HA- 
rPCK1(S90A) were transfected with Flag-INSIG1 and His-INSIG2. After 
incubation with or without lipid-depleted medium for 16 h, the cells were 
collected for Ni-NTA pull-down, immunoprecipitation and immunoblotting 
analyses as indicated (I). Viable cells were counted for 3 days after lipid 
depletion (n=6) (n). Dataare mean +s.d.NS, not significant (P= 0.708, 0.619, 
0.888, 0.901, 0.633, 0.788, 0.902, 0.764 (left to right)). Allexperiments were 
repeated three times independently with similar results. 
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Extended Data Fig. 9 | PCK1-mediated phosphorylation of INSIG1/2 
promotes the proliferation ofliver cancer cellsandthe growthoftumoursin 
mice. a, Parental Hep3B cells (2 x 10°) and the indicated clones of Hep3B cells 
(210°) with knock-in expression of PCK1(S90A) (left) or INSIG1(S207A)/ 
INSIG2(S151A) double mutants (right) were plated for 3 days. The cells were 
then collected and counted. Data are mean +s.d. (n=6).**P< 0.001 (two-tailed 
t-test). b, c, Parental Huh7 and Hep3B cells (2 x 10°) and the indicated clones of 
Huh7 and Hep3B cells with knock-in expression of PCK1(S90D) (b) or 
INSIG1(S207D)/INSIG2(S151D) double mutants (c) were plated for 3 days. The 
cells were then collected and counted. Data are mean +s.d. (n=6).**P<0.001 
(two-tailed t-test). d-h, Parental Huh7 cells (1 x 10°) or the clones with knock-in 
expression of PCK1(S90A) or INSIGI(S207A)/INSIG2(S151A) double mutants 
were subcutaneously injected into the left or right flanks of athymic nude mice, 
respectively (n=7 per group) (d, left). The resulting tumours were resected 28 
days after injection (d, right). The growth of xenografted tumours in the mice 
was measured (e) and the tumours were weighed (f). Dataare mean +s.d. (n=7). 
*P=0.002, **P< 0.001 compared with the wild-type group (two-tailed t-test). 

g, IHC analyses of tumour samples were performed with an anti-Ki67 antibody. 
Ki67-positive cells were quantified (right). h, TUNEL analyses of the indicated 
tumour samples were performed. Apoptotic cells were stained brown and 
quantified inn=10 microscopic fields (right). Data are mean +s.d.**P<0.001 


compared with the wild-type group (two-tailed test). i,j, Huh7 cells (1x 10°) 
with or without knock-in expression of PCK1(S90A) or INSIG1(S207A)/ 
INSIG2(S151A) double mutants were intrahepatically injected into athymic 
nude mice (n=7 per group). At 28 days after injection, the mice were 
euthanized and the liver tumours were dissected for immunoprecipitation, 
immunoblotting (i) and immunofluorescence (j) analyses with the indicated 
antibodies. k, Huh7 cells (1 x 10°) with or without knock-in expression of 
PCK1(S90D) (top) or INSIG1(S207D)/INSIG2(S151D) double mutants (bottom) 
were intrahepatically injected into athymic nude mice (n =7 per group). The 
mice were euthanized and examined for tumour growth 22 days after injection. 
The arrows indicate tumours. Tumour volumes were calculated (right). Data are 
mean ts.d. (n=7).**P< 0.001 compared with the wild-type group (two-tailed 
t-test). I,m, Huh7 cells (1x 10°) and Huh7 cells with knock-in expression of 
PCK1(S90D) (1) or INSIGI(S207D)/INSIG2(S151D) double mutants (m) were 
subcutaneously injected into the left or right flanks of athymic nude mice, 
respectively (n=7 per group). The resulting tumors were resected 22 days after 
injection (left). The growth of xenografted tumours in the mice was measured 
(middle) and the tumours were weighed (right). Data are mean+s.d.(n=7). 
**P< 0.001 compared with the wild-type group (two-tailed t-test). All 
experiments were repeated three times independently with similar results. 
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Extended Data Fig. 10 | Activation of the IGFIR-AKT-PCK1-INSIG1/2 
signalling cascade is required for the growth of liver tumours and correlates 
with poor prognosis for HCC. a, b, Huh7 cells (1 x 10°) with or without knock-in 
expression of PCK1(S90A) (a) or INSIG1(S207A)/INSIG2(S151A) double mutants 
(b) were stably transfected with or without Flag-IGFIR(V922E) and 
intrahepatically injected into athymic nude mice (n=7 per group). The mice were 
euthanized and examined for tumour growth 28 days after injection. The arrows 
indicate tumours. Tumour volumes were calculated (right). Dataare mean+s.d. 
(n=7).*P=0.011 (a), 0.013 (b), **P<0.001 (two-tailed t-test). c, IHC analyses of 
xenografted tumours from nude mice were performed with the indicated 
antibodies. The regions in white boxes are shown at higher magnification below. 
d,e, Huh7 cells (1 x 10°) with or without knock-in expression of PCK1(S90A) (d) or 
INSIG1(S207A)/INSIG2(S151A) double mutants (e) were stably transfected with or 
without Flag-myr-AKT1and intrahepatically injected into athymic nude mice 
(n=7 per group). The mice were euthanized and examined for tumour growth 28 
days after injection. The arrows indicate tumours. Tumour volumes were 
calculated (right). Data are mean +s.d. (n=7).*P=0.015 (d), 0.010 (e), **P< 0.001 
(two-tailed t-test). f, IHC analyses of xenografted tumours from nude mice were 
performed with the indicated antibodies. The regions in white boxes are shown 
at higher magnification below. g, h, Huh7 cells (1x 10°) with or without knock-in 
expression of PCK1(S90D) (g) or INSIG1(S207D)/INSIG2(S151D) double mutants 
(h) were stably transfected with Flag-IGFIR(L1003R) and intrahepatically 
injected into athymic nude mice (n=7 per group). The mice were euthanized and 
examined for tumour growth 28 days after injection. The arrows indicate 
tumours. Tumour volumes were calculated (right). Data are mean +s.d.(n=7). 
*P=0.003(g), 0.005 (h), **P< 0.001 (two-tailed test). i-n, Huh7 cells (1x 10°) 
with or without knock-in expression of PCK1(S90D) (i-k) or INSIG1(S207D)/ 
INSIG2(S151D) double mutants (I-n) were stably transfected with or without 
Flag-IGF1R(L1003R) and subcutaneously injected into the left flanks of athymic 
nude mice (n=7 per group). The resulting tumours were resected 28 days after 
injection (i, 1). The growth of xenografted tumours in the mice was measured (j, 
m). The tumours were weighed (k,n). Dataare mean +s.d.(n=7).*P=0.002 (j), 
0.011 (k), 0.009 (m), 0.016 (n), **P< 0.001 compared with the PCK1 

wild-type +IGF1R(L1003R) group (j,k) or compared with the INSIG1/2 

wild-type +IGF1R(L1003R) group (m, n) (two-tailed test). o-q, Huh7 cells 


(1x 10°) with or without knock-in expression of PCK1(S90D) or INSIGI(S207D)/ 
INSIG2(S151D) double mutants were stably transfected with or without Flag— 
IGF1R(L1003R) and intrahepatically injected into athymic nude mice (n=7 per 
group). The mice were euthanized 28 days after injection. IHC analyses of 
xenografted tumours from nude mice were performed with the indicated 
antibodies. The regions in white boxes are shown at higher magnification below. 
r-t, Wild-type pT3-EFla-Flag-PCK1 (or pT3-EFla-Flag-PCK1(S90A)), 
pT3-EFla-HA-myr-AKT1and pT3-EFla-V5-c-Met, along with the sleeping beauty 
transposase (SB), were stably expressed in the mouse liver using hydrodynamic 
transfection into FVB/N mice (n=7 per group). After 14 weeks, the mice were 
euthanized and representative liver tumours are shown (r, left). The average 
tumour volumes were measured (r, right) (n=7 per group). **P< 0.001 (two-tailed 
t-test).s, IHC analyses of tumour samples were performed with an anti-Ki67 
antibody (left). Ki67-positive cells were quantified in 10 microscopic fields 
(right). **P< 0.001 (two-tailed t-test). t, IHC analyses of liver tumours from nude 
mice were performed with the indicated antibodies. The regions in white boxes 
are shownat higher magnification below. u, IHC staining of 30 human HCC and 
matched non-tumour tissue samples was performed with the indicated 
antibodies. Representative images of two cases are shown. The regions in white 
boxes are shown at higher magnification below. v, Representative 
immunoblotting analyses of two cases of human HCC and matched non-tumour 
tissue samples was performed with the indicated antibodies. w, The indicated 
staining scores for PCK1(pS90), INSIG1(pS207)/INSIG2(pS151) and SREBP1 
expression levels in HCC and matched non-tumour liver samples (n=30) were 
compared using a paired t-test (two-tailed). Data are mean+s.d.**P<0.001 
compared with the non-tumour adjacent tissue. x, Representative 
immunoblotting analyses of four different human HCC samples were performed 
with the indicated antibodies. y, IHC staining of human HCC samples with the 
indicated antibodies was scored, and correlation analyses were performed. 
APearson correlation test was used (two-tailed) (n=40). Note that the scores for 
some samples overlap. z, Mechanism for PCK1-promoted activation of SREBP1 
and lipogenesis. PEP, phosphoenolpyruvate; OAA, oxaloacetate; HC, 
hydroxycholesterol; bHLH, basic helix-loop-helix protein; Reg, regulatory 
domain of SREBP. All experiments were repeated three times independently 
with similar results. 


natureresearch rer 


Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistical parameters 


When statistical analyses are reported, confirm that the following items are present in the relevant location (e.g. figure legend, table legend, main 
text, or Methods section). 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


An indication of whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


[| A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistics including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND 
variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


[| For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Clearly defined error bars 
State explicitly what error bars represent (e.g. SD, SE, Cl) 


Our web collection on statistics for biologists may be useful. 


Software and code 


Policy information about availability of computer code 


Data collection Image J (version 1.8.0) has been used for the quantification of immunoblotting and Immunofluorescence staining analyses . FV10-ASW 
Viewer software(Version 4.2b) is used to collect the immunofluorescence data. Mascot software program and Proteome Discoverer 
software program (Version 2.2) has been used for the proteomics studies. 


Data analysis Excel (2007) has been used for the two-sided T-test. Realtime PCR data: ABI 7500 soft ware (version 2.3). GraphPad Prism (V.8.1.1) has 
been used for patients’ survival analyses. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request. 


Field-specific reporting 


Please select the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


x] Life sciences Behavioural & social sciences [| Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/authors/policies/ReportingSummary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size The group sizes of the animals chosen are based on the numbers we used for previous publications, which is most optimal to generate 
statistically significant results. Triplicated experiments were conducted so that statistical significance can be tested. 


Data exclusions No data were excluded. 
Replication Experiments were repeated in triplication. All replication were successful. 
Randomization | The samples/cells for IHC studies were randomized to be examined ( No specific methods were used). 


Blinding Blinding is not relevant because all experiment groups were conducted with different treatments. 


Behavioural & social sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description Briefly describe the study type including whether data are quantitative, qualitative, or mixed-methods (e.g. qualitative cross-sectional, 
quantitative experimental, mixed-methods case study). 


Research sample State the research sample (e.g. Harvard university undergraduates, villagers in rural India) and provide relevant demographic information 
(e.g. age, sex) and indicate whether the sample is representative. Provide a rationale for the study sample chosen. For studies involving 
existing datasets, please describe the dataset and source. 


Sampling strategy Describe the sampling procedure (e.g. random, snowball, stratified, convenience). Describe the statistical methods that were used to 
predetermine sample size OR if no sample-size calculation was performed, describe how sample sizes were chosen and provide a rationale 
for why these sample sizes are sufficient. For qualitative data, please indicate whether data saturation was considered, and what criteria 
were used to decide that no further sampling was needed. 


Data collection Provide details about the data collection procedure, including the instruments or devices used to record the data (e.g. pen and paper, 


computer, eye tracker, video or audio equipment) whether anyone was present besides the participant(s) and the researcher, and whether 


the researcher was blind to experimental condition and/or the study hypothesis during data collection. 
Timing Indicate the start and stop dates of data collection. If there is a gap between collection periods, state the dates for each sample cohort. 


Data exclusions If no data were excluded from the analyses, state so OR if data were excluded, provide the exact number of exclusions and the rationale 
behind them, indicating whether exclusion criteria were pre-established. 


Non-participation State how many participants dropped out/declined participation and the reason(s) given OR provide response rate OR state that no 
participants dropped out/declined participation. 


Randomization If participants were not allocated into experimental groups, state so OR describe how participants were allocated to groups, and if 
allocation was not random, describe how covariates were controlled. 
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Ecological, evolutionary & environmental sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description 


Research sample 


Sampling strategy 


Data collection 


Timing and spatial scale 


Data exclusions 


Reproducibility 


Randomization 


Briefly describe the study. For quantitative data include treatment factors and interactions, design structure (e.g. factorial, nested, 
hierarchical), nature and number of experimental units and replicates. 


Describe the research sample (e.g. a group of tagged Passer domesticus, all Stenocereus thurberi within Organ Pipe Cactus National 
Monument), and provide a rationale for the sample choice. When relevant, describe the organism taxa, source, sex, age range and 
any manipulations. State what population the sample is meant to represent when applicable. For studies involving existing datasets, 
describe the data and its source. 


Note the sampling procedure. Describe the statistical methods that were used to predetermine sample size OR if no sample-size 
calculation was performed, describe how sample sizes were chosen and provide a rationale for why these sample sizes are sufficient. 


Describe the data collection procedure, including who recorded the data and how. 
Indicate the start and stop dates of data collection, noting the frequency and periodicity of sampling and providing a rationale for 
these choices. If there is a gap between collection periods, state the dates for each sample cohort. Specify the spatial scale from which 


the data are taken 


If no data were excluded from the analyses, state so OR if data were excluded, describe the exclusions and the rationale behind them, 
indicating whether exclusion criteria were pre-established. 


Describe the measures taken to verify the reproducibility of experimental findings. For each experiment, note whether any attempts to 
repeat the experiment failed OR state that all attempts to repeat the experiment were successful. 


Describe how samples/organisms/participants were allocated into groups. If allocation was not random, describe how covariates were 
controlled. If this is not relevant to your study, explain why. 


Blinding Describe the extent of blinding used during data acquisition and analysis. If blinding was not possible, describe why OR explain why 
blinding was not relevant to your study. 
Did the study involve field work? Yes [ ] No 


Field work, collection and transport 


Field conditions 


Location 


Access and import/export 


Disturbance 


Describe the study conditions for field work, providing relevant parameters (e.g. temperature, rainfall). 


State the location of the sampling or experiment, providing relevant parameters (e.g. latitude and longitude, elevation, water 
depth). 


Describe the efforts you have made to access habitats and to collect and import/export your samples in a responsible manner and 
in compliance with local, national and international laws, noting any permits that were obtained (give the name of the issuing 


authority, the date of issue, and any identifying information). 


Describe any disturbance caused by the study and how it was minimized. 


Reporting for specific materials, systems and methods 


aterials & experimental systems 


ethods 
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n/a | Involved in the study n/a | Involved in the study 


ChIP-seq 


Unique biological materials 


Antibodies Flow cytometry 


Eukaryotic cell lines MRI-based neuroimaging 


Palaeontology 


Animals and other organisms 


Human research participants 


Unique biological materials 


Policy information about availability of materials 


Obtaining unique materials 


Antibodies 


Antibodies used WB: GST (Santa Cruz Biotechnology, sc-138, B-14, 1:2000) 
WB: Tubulin (Santa Cruz Biotechnology, sc-8035, TU-02, 1:1000), 
WB: ERK1/2 (Santa Cruz Biotechnology, sc-514302, C-9, 1:2000) 
WB and IF: Insig1 (Proteintech, 22115-1-AP, Ag17420, WB: 1:1000; IF: 1:200), 
WB: Insig2 antibody (Thermo Fisher, PA5-41707, Rabbit polyclonal, WB: 1 :1000), 
WB: Insig1 antibody (Thermo Fisher, PA5-97876, Rabbit polyclonal, WB: 1 :1000), 
WB: AKT pS473 (Cell signaling, #4060, D9E, WB: 1:1000) 
WB: AKT (Cell signaling, #9272, Rabbit Polyclonal, WB: 1:1000) 
WB: SRC (Cell signaling, #2108, Rabbit Polyclonal, WB: 1:1000) 
WB: Phospho-p44/42 MAPK (Erk1/2) (Thr202/Tyr204) (Cell signaling, #9101, Rabbit, WB: 1:1000) 
WB: c-Jun (Cell signaling, #9165, 60A8, WB: 1:1000) 
WB: c-Jun pS73 (Cell signaling, #3270, D47G9, WB: 1:1000) 
WB: PCK1 (Cell signaling, #12940, D12F5, WB: 1:1000) 
WB: Flag (Sigma-Aldrich, F1804, M2, Mouse Monoclonal, WB: 1:5000) 
WB: Flag (Sigma-Aldrich, F7425, Rabbit Polyclonal, WB: 1:5000) 
WB: His (Sigma-Aldrich, SAB1305538, 6AT18, WB: 1:5000) 
WB: Src (phospho Y418) (ABCAM, ab4816, Rabbit polyclonal, WB: 1:1000) 
WB and IP: Insig1 (Santa Cruz Biotechnology, sc-390504, Mouse Monoclonal, A-9, WB: 1:2000; IP: 500 ug/ 0.25 ml agarose in 1 
ml) 
WB and 
WB and 
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: PCK1 (Proteintech, 16754-1-AP, Ag10261, WB: 1:1000; IP: 0.5-4.0 ug) 

: Insig2 (Proteintech, 24766-1-AP, Ag14072, WB: 1:1000; IP: 0.5-4.0 ug) 

WB and yc tag (ABCAM, ab9106, Rabbit polyclonal, WB: 1:2000, IP:1:500) 

WB and IP: HA (Cell signaling, #3724, C29F4, WB: 1:1000, IP: 1:250) 

WB, IP and IF: Myc-tag mouse (Cell signaling, #2276, 9B11, WB: 1:2000, IP:1:500, IF: 1:500) 
WB and IF: PCK1 (Novus, HO0005105-M11, 3E4, Mouse Monoclonal, WB: 1:1000; IF: 1:200) 
WB and IF: Calnexin (ABCAM, ab22595, Rabbit polyclonal, WB: 1:1000; IF:1:200) 

WB and IF: Calnexin (Thermo Fisher, MA3-027, AF18, Mouse Monoclonal, WB: 1:1000; IF:1:200) 
WB and IF: SCAP antibody (ABCAM, ab190103, Rabbit polyclonal, WB: 1:1000; IF: 1:300) 
WB: Golgi 97 (ABCAM, ab84340, Rabbit polyclonal, WB: 1:1000) 

WB: SCD1 (ABCAM, ab19862, CD.E10, Mouse Monoclonal, WB: 1:1000) 

WB: FASN (ABCAM, ab22759, Rabbit polyclonal, WB: 1:1000) 

WB: SREBP1 (BD Biosciences, 557036, IgG 2A4, WB: 1:1000) 

WB: SREBP2 (BD Biosciences, 557037, IgG-1C6, WB: 1:1000) 

WB: anti-Rabbit IgG heavy chain (HRP) (ABCAM, ab99702, 2A9, Mouse, WB: 1:5000) 

WB: Rabbit IgG-HRP (Thermo Fisher, 31458, polyclonal, 1:5000) 

WB: Mouse IgG-HRP (Thermo Fisher, 31430, polyclonal, 1:5000) 

WB and IHC: PCK1 pS90 (Signalway, #58006, Rabbit polyclonal, WB: 1:1000, IHC: 1:100) 
WB and IHC: Insig1/2 pS207/S151 (Signalway, #58005, Rabbit polyclonal, WB: 1:1000, IHC: 1:100) 
P: AKT (Cell signaling, #2920, 40D4, Mouse Monoclonal, IP: 1:100) 

P: Normal mouse IgG (Santa Cruz Biotechnology, sc-2025, IP: 0.5-4.0 ug, control) 

P: Normal rabbit IgG (Santa Cruz Biotechnology, sc-2027, IP: 0.5-4.0 ug, control) 

F: Golgi 97 (Thermo Fisher, A-21270, CDF4, Mouse Monoclonal, IF: 1:200) 

F and IHC: SREBP1 antibody (Novus, NB100-2215, 2A4, IF: 1:200; IHC: 1:200) 

F: Rabbit IgG-Alexa Fleor 488 (Invitrogen, A11008, 34732A, Polyclonal, 1:500) 

F: Rabbit IgG-Alexa Fleor 594 (Invitrogen, A11012, 1810936, Polyclonal, 1:500) 

F: Mouse IgG-Alexa Fleor 488 (Invitrogen, A11029, 673781, Polyclonal, 1:500) 

F: Mouse IgG-Alexa Fleor 594 (Invitrogen, A11005, 610868, Polyclonal, 1:500) 

HC: Ki67 (Sigma-Aldrich, #AB9260, Rabbit polyclonal, IHC: 1:200) 


p 
p 
p 
p 


1 


T 


Validation Antibodies were only used for the application as indicated and organisms verified by the manufactures. Besides, antibodies 
against PCK1 S90 phosphorylation and Insig1/2 $207/S151 phosphorylation were also double-validated for immunoblotting and 
IHC analyses by competitive blocking assay with PCK1 pS90 (Extended Data Fig. 2h) and Insig1/2 pS207/S151 peptides (Extended 
Data Fig. 3j-k) . 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) Hep3B, Huh7, H1993, CHL-1, SNU-398, SNU-475, HL7702, THLE-2 and 293T cells were from ATCC. The U251 and U87 GBM 
cells used in the experiments were authenticated using short tandem repeat profiling at The University of Texas MD 


Anderson Cancer Center. 


Authentication Cells were authenticated using short tandem repeat profiling at The University of Texas MD Anderson Cancer Center 
(Houston, Texas) 


Mycoplasma contamination All cells lines are confirmed without Mycoplasma contamination. 


Commonly misidentified lines No commonly misidentified lines were used 
(See ICLAC register) 


Palaeontology 


Specimen provenance Provide provenance information for specimens and describe permits that were obtained for the 


Specimen deposition 
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Dating methods 


Tick this box to confirm that the raw and calibrated dates are available in the paper or in Supplementary Information. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Female, 4-week-old BALB/c athymic nude mice and Wild-type FVB/N mice were used. The use of the animals was approved by 
the Institutional Review Board at MD Anderson Cancer Center and the Institutional Animal Care and Use Committee (IACUC) of 
Zhejiang University. 


Wild animals This study did not involve wild animals 


Field-collected samples No sample was collected from the field. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics Age and Gender are randomly distributed in different groups. Sample numbers are large enough for statistical analysis. 
Human HCC and adjacent matched nontumor tissue samples from 90 patients (EHBH cohort) were obtained from Eastern 
Hepatobiliary Surgery Hospital in Shanghai, China. The use of human HCC samples and the relevant database was approved by 
the Eastern Hepatobiliary Surgery Hospital Research Ethics Committee and complied with all relevant ethical regulations. All 
tissue samples were collected in compliance with informed consent policy. A total of 90 patients who undertook surgical 
resection in Eastern Hepatobiliary Surgery Hospital from 2009 to 2011 were included in this study. 69 males and 21 females, 
with an average (+ SD) age of 46.7 (+ 10.3) years undertook surgical resection. All patients had received standard therapies after 
surgery. Sections of paraffin-embedded human HCC samples were stained with antibodies against AKT pS473, PCK1 pS90, Insig1 
pS207/Insig2 pS151, SREBP1, or nonspecific IgG as a negative control. The staining of the tissue sections was quantitatively 
scored according to the percentage of positive cells and staining intensity. Scores were compared with overall survival duration, 
defined as the time from date of diagnosis to that of death or last known follow-up examination. 


Recruitment participants were not recruited. 


ChIP-seq 


Data deposition 


Confirm that both raw and final processed data have been deposited in a public database such as GEO. 


Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks. 


Data access links 
May remain private before publication. 


Files in database submission 


Genome browser session 
(e.g. UCSC) 


Methodology 


Replicates Describe the experimental replicates, specifying number, type and replicate agreement. 


Sequencing depth Describe the sequencing depth for each experiment, providing the total number of reads, uniquely mapped reads, length of 
reads and whether they were paired- or single-end. 


Antibodies Describe the antibodies used for the Ch/P-seq experiments; as applicable, provide supplier name, catalog number, clone 
name, and lot number. 


Peak calling parameters Specify the command line program and parameters used for read mapping and peak calling, including the ChIP, control and 
index files used. 


Data quality Describe the methods used to ensure data quality in full detail, including how many peaks are at FDR 5% and above 5-fold 
enrichment. 
Software Describe the software used to collect and analyze the ChIP-seq data. For custom code that has been deposited into a 


community repository, provide accession details. 
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Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a ‘group’ is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


ethodology 

Sample preparation Describe the sample preparation, detailing the biological source of the cells and any tissue processing steps used. 
Instrument Identify the instrument used for data collection, specifying make and model number. 

Software Describe the software used to collect and analyze the flow cytometry data. For custom code that has been deposited into a 


community repository, provide accession details. 


Cell population abundance __| Describe the abundance of the relevant cell populations within post-sort fractions, providing details on the purity of the samples 
and how it was determined. 


Gating strategy Describe the gating strategy used for all relevant experiments, specifying the preliminary FSC/SSC gates of the starting cell 
population, indicating where boundaries between "positive" and "negative" staining cell populations are defined. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 


Magnetic resonance imaging 


Experimental design 


Design type Indicate task or resting state; event-related or block design. 


Design specifications Specify the number of blocks, trials or experimental units per session and/or subject, and specify the length of each trial 
or block (if trials are blocked) and interval between trials. 


Behavioral performance measures State number and/or type of variables recorded (e.g. correct button press, response time) and what statistics were used 
to establish that the subjects were performing the task as expected (e.g. mean, range, and/or standard deviation across 


subjects). 
Acquisition 
Imaging type(s) Specify: functional, structural, diffusion, perfusion. 
Field strength Specify in Tesla 
Sequence & imaging parameters Specify the pulse sequence type (gradient echo, spin echo, etc.), imaging type (EPI, spiral, etc.), field of view, matrix size, 


slice thickness, orientation and TE/TR/flip angle. 


Area of acquisition State whether a whole brain scan was used OR define the area of acquisition, describing how the region was determined. 


Diffusion MRI Used Not used 


Preprocessing 


Preprocessing software Provide detail on software version and revision number and on specific parameters (model/functions, brain extraction, 
segmentation, smoothing kernel size, etc.). 


Normalization If data were normalized/standardized, describe the approach(es): specify linear or non-linear and define image types 
used for transformation OR indicate that data were not normalized and explain rationale for lack of normalization. 


Normalization template Describe the template used for normalization/transformation, specifying subject space or group standardized space (e.g. 
original Talairach, MNI305, ICBM152) OR indicate that the data were not normalized. 


Noise and artifact removal Describe your procedure(s) for artifact and structured noise removal, specifying motion parameters, tissue signals and 
physiological signals (heart rate, respiration). 


Volume censoring Define your software and/or method and criteria for volume censoring, and state the extent of such censoring. 
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Statistical modeling & inference 


Model type and settings Specify type (mass univariate, multivariate, RSA, predictive, etc.) and describe essential details of the model at the first 
and second levels (e.g. fixed, random or mixed effects; drift or auto-correlation). 


Effect(s) tested Define precise effect in terms of the task or stimulus conditions instead of psychological concepts and indicate whether 
ANOVA or factorial designs were used. 

Specify type of analysis: Whole brain ROI-based [| Both 

Statistic type for inference Specify voxel-wise or cluster-wise and report all relevant parameters for cluster-wise methods. 


(See Eklund et al. 2016) 


Correction Describe the type of correction and how it is obtained for multiple comparisons (e.g. FWE, FDR, permutation or Monte 
Carlo). 


Models & analysis 


n/a | Involved in the study 


Functional and/or effective connectivity 


Graph analysis 


Multivariate modeling or predictive analysis 


Functional and/or effective connectivity Report the measures of dependence used and the model details (e.g. Pearson correlation, partial 
correlation, mutual information). 


Graph analysis Report the dependent variable and connectivity measure, specifying weighted graph or binarized graph, 
subject- or group-level, and the global and/or node summaries used (e.g. clustering coefficient, efficiency, 
etc.). 


Multivariate modeling and predictive analysis Specify independent variables, features extraction and dimension reduction, model, training and evaluation 
metrics. 
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® Check for updates 


Separation of eukaryotic sister chromatids during the cell cycle is timed by the spindle 
assembly checkpoint (SAC) and ultimately triggered when separase cleaves 
cohesion-mediating cohesin’°. Silencing of the SAC during metaphase activates the 


ubiquitin ligase APC/C (anaphase-promoting complex, also known as the cyclosome) 
and results in the proteasomal destruction of the separase inhibitor securin’. In the 
absence of securin, mammalian chromosomes still segregate on schedule, but itis 
unclear how separase is regulated under these conditions**. Here we show that human 
shugoshin 2 (SGO2), an essential protector of meiotic cohesin with unknown 
functions in the soma®’, is turned into a separase inhibitor upon association with 
SAC-activated MAD2. SGO2-MAD2 can functionally replace securin and sequesters 
most separase in securin-knockout cells. Acute loss of securin and SGO2, but not of 
either protein individually, resulted in separase deregulation associated with 
premature cohesin cleavage and cytotoxicity. Similar to securin®”, SGO2is a 
competitive inhibitor that uses a pseudo-substrate sequence to block the active site of 
separase. APC/C-dependent ubiquitylation and action of the AAA-ATPase TRIP13 in 
conjunction with the MAD2-specific adaptor p31°™ liberate separase from SGO2- 
MAD2in vitro. The latter mechanism facilitates a considerable degree of sister chromatid 
separation in securin-knockout cells that lack APC/C activity. Thus, our results identify an 
unexpected function of SGO2 in mitotically dividing cells anda mechanism of separase 
regulation that is independent of securin but still supervised by the SAC. 


In all eukaryotic cells, anaphase is triggered when chromosomal 
cohesin is cleaved by the essential Cys-endopeptidase separase*””. 
To prevent the premature loss of sister chromatid cohesion, separase 
needs to be tightly controlled. Separase is competitively inhibited by 
association with securin for most of the cell cycle. Only in metaphase 
does the E3 anaphase-promoting complex or cyclosome (APC/C) 
mediate the degradation of securin via the ubiquitin-proteasome 
system, thereby activating separase”. The destruction of securin 
is timed by the SAC, which keeps the APC/C co-activator CDC20 
inactive until all kinetochores are properly attached to spindle 
microtubules’. 


SGO2 is a prominent interactor of separase 


Unexpectedly, securin is not essential in human cells or mice*>. This 
can partially be explained by CDK1-cyclin B1-dependent regulation 
of separase””. Mouse Cdc20 embryos arrest in metaphase with 
cohered chromosomes because they cannot degrade either cyclin 
Blorsecurin'®. Notably, double knockout of Cdc20 and Pttg1 (which 
encodes securin) resulted in arrest with separated sister chromatids; 
this defect was rescued by constitutive activation of the SAC’, which 
suggest that there is aSAC-dependent but securin-independent 
mechanism to control separase. Rather than being stimulated by 
the SAC, the binding of CDK1-cyclin B1 to separase is dampened by 


phosphorylation of cyclin B during early mitosis’. However, a link 
between the SAC and the cohesin protector shugoshin (SGO) had 
previously been identified in that human SGO2-similar to CDC20— 
is bound by the essential SAC component MAD2’. Notably, mouse 
SGO2 and separase interacted when co-expressed in Hek293 cells 
(Extended Data Fig. 1a; see, however, Extended Data Fig. 1b). These 
findings led to the idea that SAC-activated MAD2 could enable SGO2 
to bind and inhibit separase. Indeed, when endogenous human 
separase was isolated by immunoprecipitation (IP) from untrans- 
fected, prometaphase-arrested Hek293T or untransformed RPE-1 
cells, SGO2 (but not the related SGO1) and MAD2 co-purified, along 
with the known interactors securin and cyclin B1 (Fig. 1a, Extended 
Data Fig. 1c). In contrast to separase, SGO2 and MAD2 were unde- 
tectable in a securin IP and, vice versa, securin and cyclin B1 were 
absent from an SGO2 IP (Fig. 1a). With previous results”°, these 
data argue that three mutually exclusive complexes co-exist in 
human mitotic cells: separase-securin, separase-CDK1-cyclin B1 
and separase-SGO2-MAD2. Following immunodepletion of the 
three inhibitors from taxol-arrested Hek293T, HCT116 and HeLa-K 
cells, we quantified the relative amounts of associated separase. On 
average, 59, 35, and 6% of total separase was sequestered by securin, 
SGO2, and cyclin B1, respectively (Extended Data Fig. 1d). Notably, 
in mitotic PTTGI™“ cells, most separase (85%) was in complex with 
SGO2 (Fig. 1b). 
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Fig. 1| MAD2-dependent binding of human SGO2 toseparase. a, Endogenous 
proteins were immunoprecipitated (IP) from taxol-arrested Hek293T cells and 
analysed by immunoblotting. Mock, nonspecific IgG. Sep., separase; sec., 
securin; cyc.B1, cyclin B1. b, Lysates from taxol-arrested PTTGI“ cells were 
subjected to consecutive immunodepletions and analysed by immunoblotting. 
c, Transgenic Hek293 cells depleted of SGO2 or control-treated (Ctrl) were 
transfected to express siRNA-resistant SGO2 variants as indicated (transgen. 
SGO2; wild-type (WT), R153D or N58I), induced with doxycycline to express 
CDK1-cyclin B1-resistant, stabilized Myc-separase(S1126A), arrested in 


MAD2 enables SGO to bind separase 


To test whether binding of SGO2 to separase required MAD2, we 
used RNA interference (RNAi) to deplete cells of endogenous 
SGO2 and replaced it with small interfering RNA (siRNA)-resistant, 
transgene-encoded variants. Subsequent immunoprecipitation of 
co-expressed Myc-separase showed that the MAD2-binding-deficient 
SGO2(R153D) was unable to interact with separase, whereas SGO2(NS58I), 
which cannot interact with protein phosphatase 2A (PP2A)’, still bound 
to MAD2 and separase (Fig. 1c). The SAC-shugoshin link is conserved 
in Xenopus, with the difference that here SGO1 rather than SGO2 binds 
MAD2 and separase’ (Extended Data Fig. 2). Thus, the separase-shu- 
goshin interaction depends on active MAD2 and is conserved in ver- 
tebrates. 

The separase-SGO2-MAD2 complex was present in cells arrested 
in G1, early S, G2, and prometaphase (Fig. 1d) but not in taxol-treated 
HeLa-K cells that were driven into an anaphase-like state by SAC abro- 
gation with the aurora B kinase inhibitor ZM-447439 (Fig. le). This cell 
cycle distribution mirrored that of CDC20-MADz2, the formation of 
which in interphase requires MAD1-dependent MAD2 activation at 
nuclear pore complexes (NPCs)”. Consistently, separase-SGO2-MAD2, 
similar to CDC20-MAD2, became (almost) undetectable in unsyn- 
chronized HeLa-K cells depleted of MAD1 and/or the NPC component 
TPR (Fig. 1f). Thus, conformationally activated MAD2 enables SGO2 
to sequester separase through all of the cell cycle except for a short 
period of SAC inactivity during late mitosis. 


Separase deregulation upon loss of securin and SGO 

Overexpression of a CDK1-cyclin Bl-resistant and stabilized 
separase(S1126A) variant causes premature sister chromatid separa- 
tion (PCS) in Hek293T cells”. This PCS phenotype was aggravated 
by siRNA-mediated depletion of endogenous SGO2 and alleviated 


prometaphase, and then analysed by IP-immunoblotting (left) and 
chromosome spreading (right). Bars show mean of three independent 
experiments (dots). Cat., catalytic subunit. d, HeLa-K cells synchronized 

in the indicated cell cycle phases were subjected to IP-immunoblotting. 

e, Taxol-arrested HeLa-K cells were released with ZM-447439 and subjected to 
time-resolved IP-immunoblotting. f, Control, MAD1-depleted, or 
TPR-depleted, thymidine-arrested HeLa-K cells were analysed by IP- 
immunoblotting. a-tub., a-tubulin. 


by simultaneous slight overexpression of wild-type SGO2 from an 
siRNA-resistant transgene (Fig. 1c). SGO2(R153D) did not rescue PCS 
in this cellular assay, whereas transgenic SGO2(N531) remained func- 
tional. Together, these findings suggest that SGO2 might indeed have 
an inhibitory effect on separase and that this effect requires binding 
of MAD2 but not PP2A to SGO2. 

By recruiting PP2A to (peri)centromeres, SGO2 exerts essential 
cohesin protective functions throughout meiosis I but, similar to 
securin, it is dispensable in somatic cells®’. If securin and SGO2-MAD2 
could functionally replace each other as crucial negative regulators 
of separase, then co-depletion of securin and SGO2 should result in 
detrimental deregulation of separase. First, we assessed overall effects 
oncell viability and proliferation using clonogenic assays. As expected, 
knockdown of securin or SGO2 alone had no effect or only a small 
inhibitory effect, respectively, on HeLa-K colony formation (Fig. 2a, 
Extended Data Fig. 3a). By sharp contrast, hardly any clones grew when 
both separase interactors were depleted at the same time. Similarly, 
depletion of SGO2 virtually extinguished colony formation in PTTGI” 
cells, whereas it only halved colony numbers in the parental HCT116 
cell line (Extended Data Fig. 3b). The same tendencies were found for 
Hek293T cells, although the effects were less pronounced (Extended 
Data Fig. 3c). Live imaging of histone H2B-eGFP-expressing HeLa- 
K cells revealed that the individual knockdowns did not affect mitosis; 
however, when both securin and SGO2 were missing, the ability to form 
proper metaphase plates was markedly compromised (Extended Data 
Fig. 4a). HeLa-K cells lacking securin and SGO2 were also marked by the 
otherwise uncommon absence of cohesin from early mitotic chromatin 
(Extended Data Fig. 4b). Individual knockdown of securin or SGO2 had 
no effect on cohesion, whereas simultaneous removal of both resulted 
in PCS as judged by chromosome spreads from prometaphase-arrested 
Hek293T cells (Extended Data Fig. 4c, d). The PCS phenotype was 
further fortified by chemical inhibition (using epigallocatechin- 
3-gallate (EGCG)) of PIN1, a peptidyl-prolyl-isomerase required for 
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Fig. 2|Co-depletion of securin and SGO2 is cytotoxic and results in 
premature sister chromatid separation and cohesin cleavage. a, Exemplary 
photographs of clonogenic assay with HeLa-K cells transfected with the 
indicated siRNAS. Scale bar, 1cm. b, siRNA-transfected Hek293T cells were 
supplemented with taxol and EGCG and analysed by chromosome spreading. 
Bars show mean of three independent experiments (dots). c, siRNA-transfected 
HeLa-K cells were taxol-treated and analysed by IP-immunoblotting. WCE, 
whole-cell extracts. d, Securin-depleted (siPTTGI) and separase-depleted 
(siESPL1) Hek293T cells expressing siRNA-resistant, transgenic separase 
(wild-type, AEEEL (V4), or MxxIxEE to AxxAXxAA (V5)) were treated with taxol 
and EGCG and analysed by chromosome spreading (left) and IP- 
immunoblotting (right). Bars show mean of three independent experiments 
(dots). Arrowhead marks auto-cleaved separase. 


CDK1-cyclin B1-dependent inhibition of separase” (Fig. 2b, Extended 
Data Fig. 4c, d). In keeping with the separase-SGO2 interaction being 
dependent on MAD2, partial removal of MAD2, which alone had no 
effect, inhibited colony formation and exacerbated PCS in conjunction 
with RNAi of securin (Fig. 2b, Extended Data Figs. 3, 4c, d). All of these 
phenotypes could be explained by precocious separase-dependent 
cleavage of cohesin. Indeed, consistent with a previous study in yeast”’, 
the characteristic fragments of the RAD21 subunit of cohesin were 
detected in prometaphase-arrested cells if—and only if—both securin 
and SGO2 had been depleted (Fig. 2c). Likewise, a transiently expressed 
separase activity sensor was already maximally cleaved in securin- and 
SGO2-depleted HeLa-K cells ina taxol arrest, whereas in mock-depleted 
cells the sensor was cleaved only upon addition of ZM-447439 (Extended 
Data Fig. 4e). Screening of separase mutants identified two variants (V4 
and VS) that could not interact with SGO2-MAD2 (Extended Data Fig. 5). 
When endogenous separase was replaced by these variants, depletion 
of securin was sufficient to induce PCS (Fig. 2d). This indicates that a 
direct separase inhibitory function of SGO2 works redundantly with 
securin to prevent PCS. 

Ininterphase, separase is excluded from the nucleus but has an estab- 
lished function in centriole disengagement™. Therefore, we assessed this 
licensing step of centrosome duplication rather than cleavage of chro- 
mosomal cohesin upon depletion of securin and SGO2. Premature cen- 
triole disengagement in G2-phase was increased threefold in Hek293T 
cells lacking securin and SGO2 relative to singly- or mock-depleted 
Hek293T cells (45% versus 10-15%, respectively) (Extended Data Fig. 6). 
Thus, cells lacking securin and SGO2 already contain active separasein 
interphase and, hence, are expected to lose sister chromatid cohesion 
immediately upon breakdown of the nuclear envelope. 
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Shugoshins are characterized not only by an N-terminal 
coiled-coil domain but also by a C-terminal SGO-box that binds 
to BUB1-phosphorylated histone H2A”. Phosphorylated, but not 
unphosphorylated, H2A peptide bound immobilized SGO2 (Extended 
Data Fig. 7a—c). Notably, pre-incubation with phospho-H2A but not 
with unphosphorylated H2A or a chemically similar but irrelevant 
phospho-H3 peptide suppressed the ability of SGO2 to bind MAD2 
(Extended Data Fig. 7a—d). Conversely, pre-incubation of immobi- 
lized SGO2 with MAD2 prevented subsequent interaction between 
phospho-H2A and SGO2 (Extended Data Fig. 7e, f). Phospho-H2A did 
bind to SGO2 when previous SGO2-MAD2 complex formation was not 
possible owing to the expression of SGO2(R153D) ora variant of MAD2 
lacking the C-terminal domain (MAD2AC) instead of the correspond- 
ing wild-type proteins. Thus, MAD2 and phospho-H2A bind SGO2 ina 
mutually exclusive manner. We propose that different pools of SGO2 
either associate with chromatin or bind MAD2 and inhibit separase. 


SGO is a pseudosubstrate inhibitor of separase 


Incubation of the human separase-securin complex in 
securin-degrading, anaphase-like Xenopus laevis egg extracts followed 
by affinity purification of separase generates active protease that spe- 
cifically cleaves *S-labelled RAD21”. Pre-incubation of this separase 
with in vitro-expressed SGO2 and MAD2 purified from Escherichia coli 
blocked cleavage of RAD21 (Fig. 3a). The same effect was seen when 
SGO2(N53]) was used instead of wild-type SGO2, but not when SGO2 
or MAD2 was omitted or replaced by SGO2(R153D) or MAD2AC, respec- 
tively. Thus, the PP2A-independent but MAD2-dependent inhibition of 
human separase by SGO2 can be recapitulated in vitro. 

We investigated whether shugoshin inhibits separase in the same way 
as securin—by occupying the catalytic site with anon-cleavable pseudo- 
substrate sequence®”. Using interaction-blocking antibodies, protein 
fragments and point mutations, we mapped sites within X. laevis Sgo1 
that are important for separase interaction (Extended Data Fig. 8). Nearby 
putative pseudosubstrate sites (ExxX, with @ denoting a hydrophobic 
residue, x denoting any residue and X denoting any residue except R) 
were then changed into consensus sites (PExxR) and the resulting vari- 
ants screened for cleavage by active X. laevis separase. Notably, X. laevis 
$go1(S135R) (but not Sgo1(F288R) or the wild type) was cleaved by sepa- 
rase, and this cleavage was much more pronounced in the presence of 
wild-type MAD2than in the presence of MAD2AC (Extended Data Fig. 9a). 

To confirm this finding independently, we switched back to the 
human system and tested whether a specific @ExxX-to-@ExxR muta- 
tion could also turn human SGO2 into a separase substrate. Indeed, 
35 -labelled SGO2(MI114R) (but not SGO2(F95R) or SGO2(S126R)) was 
fragmented in the presence of active separase and wild-type MAD2 
(Fig. 3b). Cleavage of SGO2(M114R)—similar to that of RAD217°—was 
further enhanced upon phosphorylation by Polo-like kinase 1. To test 
in vivo cleavage, we transfected HeLa-K cells to express Flag-tagged 
variants of SGO2 or securin. Subsequent immunoprecipitation from 
prometaphase lysates demonstrated that both securin(F118R) (positive 
control) and SGO2(M114R) were fragmented, whereas the wild-type 
proteins remained unprocessed (Fig. 3c). Cleavage of SGO2(MI114R) was 
separase-specific because it was suppressed by introduction of asecond 
mutation that compromised the interaction of SGO2 with separase 
(amino acids 239-242 of SGO2 to Ala; Fig. 3c, Extended Data Fig. 9b). 
Thus, shugoshin resembles securin in acting as a competitive inhibi- 
tor of separase but differs in that it requires MAD2 binding to doso. 

X. laevis Sgo1 and human SGO2 are very different in sequence (21% 
similarity) and length (663 versus 1,265 residues). However, the relative 
order of and distance between functional elements are the same in both 
proteins (Extended Data Fig. 9c). This suggests that MAD2 binding and 
separase inhibition evolved before duplication of a primordial SGO 
gene and were later lost randomly from one SGO gene but retained in 
the other owing to selective pressure. 
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Fig. 3 |SGO2 is aMAD2-dependent, competitive inhibitor of separase. a, 
Protease-dead (PD) or active separase was incubated with variants of 
recombinant SGO2 and MAD2andassayed for its ability to cleave *S-labelled 
RAD21. Relevant upper and lower parts of the same gel are shown. b, 
35§-labelled SGO2 variants were incubated with MAD2 and, where indicated, 
Polo-like kinase 1 (PLK1) before being assayed for in vitro cleavage by active (Ac) 


TRIP13 liberates separase from SGO2-MAD2 


In contrast to securin, human SGO2 is not (or only slowly and incom- 
pletely) degraded in late mitosis. This raises the question of how sepa- 
rase is liberated from SGO2-MAD2 when cells are ready to undergo 
anaphase. The CDC20- and MAD2-containing mitotic checkpoint 
complex (MCC) is disassembled by the combined action of the 
AAA-ATPase TRIP13 and its MAD2-specific adaptor p31°™ (ref. 7”). We 
tested whether this molecular machine could also dismantle the sepa- 
rase-SGO2-MAD2 complex. The complex was immunoprecipitated 
from securin-depleted, taxol-arrested HeLa-K cells using antibodies 
against separase and incubated with different combinations of recom- 
binant TRIP13 and p31°™* variants. Beads were then washed to remove 
detached proteins, incubated with *S-RAD21 to assay for activity of 
the immobilized separase, and finally analysed for retained proteins. 
Notably, wild-type TRIP13 and p31" quantitatively displaced SGO2 
and MAD2 from separase, thereby leaving it proteolytically active 
(Fig. 4a). Whereas TRIP13 alone partially disassembled separase- 
SGO2-MAD2, the Walker-A and -B mutant variants of TRIP13 were inac- 
tive even in the presence of p31°™". Activation of separase by TRIP13 
was also prevented by p31°°"" variants that were defective in MAD2 
or TRIP13 interaction. To assess the role of TRIP13 and p31°™* in vivo, 
we additionally transfected securin-depleted HeLa-K cells with siRNAS 
against TRIP13 and p31°™", synchronized the cells in prometaphase 
with taxol and then released them using ZM-447439. Unexpectedly, 
late mitotic events, such as the de-phosphorylation of CDC27 and 
histone H3 and the degradation of cyclin B1, were only slightly delayed 
or occurred largely on schedule in cells lacking TRIP13 and p31°™* 
relative to control-treated cells (Fig. 4b, top). By contrast, the sepa- 
rase-SGO2-MAD2 complex was markedly stabilized in the absence 
of TRIP13 and p31°™, as revealed by separase immunoprecipitation 


separase. Relevant upper and lower parts of the same gel are shown. c, SGO2- 
and securin-depleted HeLa-K cells were transfected to express the indicated 
siRNA-resistant, Flag-tagged SGO2 or securin variants, taxol-arrested and 
analysed by IP-immunoblotting. Separase-induced N- and/or C-terminal 
cleavage fragments of SGO2 and securin are labelled -N and-C. 


and immunoblotting (Fig. 4b, bottom). Thus, separase-SGO2-MAD2 
is actively dismantled by TRIP13 and p31°™" and might depend on 
this molecular machine for its disassembly in late mitosis even more 
than the MCC. 


Sister separation without APC/C activity 


The above results suggest that sister chromatid separation in the 
absence of APC/C°™ activity could be possible when separase is 
chiefly controlled by SGO2-MAD2 instead of securin. To test this 
prediction, we supplemented taxol-arrested PTTGI™ and parental 
HCT116 cells with the two APC/C inhibitors proTame and Apcin (or 
carrier solvent), released the cells by adding ZM-447439, and analysed 
them by time-resolved immunoprecipitation-immunoblotting 
and chromosome spreading. Thirty-five minutes after inhibition 
of aurora B kinase, up to 60% of chromosomes were separated in 
PTTGI™ cells but only 20% on average in parental HCT116 cells, 
despite the persistence of CDC27 phosphorylation and cyclin B1 
in both (Fig. 4c, d). Whereas levels of separase-associated securin 
also stayed constant in APC/C-inhibited HCT116 cells, SGO2 and 
MAD2 (which appeared to be overexpressed in PTTGI“ cells) disap- 
peared from separase irrespective of securin status (Fig. 4d). Con- 
sistent with SGO2-MAD2 being the primary inhibitor of separase, 
auto-cleavage of separase was strongly increased in PTTGI “ cells. 
Thus, activation of SGO2-MAD2-inhibited separase occurs at least 
partially independently of APC/C. However, sister chromatid sepa- 
ration was more effective in the absence of proTame and Apcin, 
eveninPTTGI1™ cells (Fig. 4c). Given that APC/C-associated MCC 
is disassembled also upon ubiquitylation of CDC20”%, and given 
the recent identification of separase and SGO2 as APC/C interac- 
tors or substrates”’, we tested whether separase-SGO2-MAD2 
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and autoradiograph (right; relevant upper and lower parts of the same gel are 
shown). A, TRIP13(G184A); B, TRIP13(E253Q); OF, p31°™*"(Q83A, F191A); 

PK, p31°™"*'(P228A, K229A). b, siRNA-transfected, taxol-arrested HeLa-K cells 
were released using ZM-4.47439 (ZM) and subjected to time-resolved IP- 
immunoblotting. c,d, Taxol-arrested PTTGI” and parental HCT116 cells were 
supplemented with ZM and APC/C inhibitors or carrier solvent (mock) and 
analysed by chromosome spreading (c) and time-resolved IP-immunoblotting 
(d). Bars show mean of four independent experiments (dots). Red diamonds 


MAD1-—MAD2 
at kinetochores 
(at NPCs in interphase) 


could also be dismantled by APC/C-dependent ubiquitylation 
in vitro. Indeed, incubation of the immobilized complex with E1, 
E2s (UBE2C and UBE2S) and active APC/C©° in the presence of 
ubiquitin and ATP displaced SGO2 and MAD2 and rendered separase 
proteolytically active (Extended Data Fig. 10). Dissociation was 
accompanied by ubiquitylation of separase. The complex stayed 
intact, however, when UBE2C was replaced by a dominant-negative 
variant or when APC/C“ was omitted. These results suggest that 
APC/C-dependent ubiquitylation represents a second mode of 
separase-SGO2-MAD2 disassembly. 

We propose a model in which there is bifurcated regulation of sepa- 
rase downstream of the SAC (Fig. 4e). Next to the APC/C°-securin 
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indicate experiment shown ind. e, Model of bifurcated regulation of separase. 
Activation of MAD2 by SAC signalling inhibits APC/C®°, thereby stabilizing 
securin, and enables shugoshin to directly inhibit separase. Liberation of 
separase from securin requires TRIP13—p31°™*-dependent dissociation of 
MAD2 from CDC20 followed by APC/C-dependent degradation of securin. By 
contrast, TRIP13-p31°™*-dependent dissociation of MAD2 from shugoshin 
leads to direct activation of associated separase. The alternative dissociation 
of MAD2 from its targets by APC/C°-dependent ubiquitylation and the 
CDK1-cyclin B1-dependent inhibition of separase are omitted for clarity. 


axis, asecond major branchis represented by mammalian SGO2 (Sgol 
in amphibians) which is turned into a direct, competitive inhibitor 
of separase by SAC-activated MAD2. Both branches use TRIP13- 
p31°°™* (and APC/C-dependent ubiquitylation) for disassembly of 
their respective MAD2-containing complexes. However, while this 
liberates separase from shugoshin, the canonical branch addition- 
ally requires proteasomal destruction of securin. At least in the cell 
lines tested here, securin and SGO2 can each compensate for loss 
of the other. The reason for this seeming redundancy remains to be 
clarified, but the different requirements for protein degradation 
might make it beneficial at times to rely on one or the other mode of 
separase regulation. 
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Methods 


Antibodies 

Antibodies generated within this study are from Charles River Labo- 
ratories. Antigenic peptides (Bachem) were coupled via terminal Cys 
to maleimide-activated KLH (ThermoFisher) before immunization. 
Antibodies were affinity-purified against immobilized antigens (pro- 
teins coupled to NHS-activated sepharose (GE Healthcare) or peptides 
coupled to Sulfo-link (ThermoFisher)). The following antibodies were 
used for immunoblotting according to standard protocols. Antibodies 
directed against human proteins: rabbit or guinea pig anti-SGO2 (1.5 
pg/ml; ‘ab 1’; anti-DVPPRESHSHSDQSSKC), rabbit anti-SGO2 (1 pg/ml; 
‘ab 2’; anti-KSEDLSSERTSRRRRC), guinea pig anti-TRIP13 (raised against 
full-length TRIP13), rabbit anti-p31°°"" (raised against isoform 2 of 
full-length p31°"), rabbit anti-separase””, mouse anti-securin (1:1,000; 
MBL, DCS-280), mouse anti-Flag (1:2,000; Sigma-Aldrich, M2), rabbit 
anti-SGO2 (1:1,000; Bethyl, A301-262A), rabbit anti-SGO1 (1:500, Abcam 
ab21633), mouse anti-MAD2 (1:800; Santa Cruz Biotechnology, 17D10), 
rabbit anti-MAD2 (1:1,000; Bethyl, A300-300A), mouse anti-MAD1 
(1:1,000; Sigma-Aldrich, 9B10), mouse anti-APC7 (1:800; ThermoFisher, 
PAS-20948), rabbit anti-phosphoSer10-histone H3 (H3-pS10; 1:1,000; 
Millipore, 06-570), mouse anti-PP2A-C (1:1,000; Millipore, 1D6), mouse 
anti-cyclin B1 (1:1,000; Millipore, 05-373), goat anti-CDC27°°, mouse 
anti-topoisomerase Ila (1:1,000; Enzo Life Sciences, 1C5), mouse 
anti-cyclin A2 (1:200; Santa Cruz Biotechnology, 46B11), mouse 
anti-RAD21 (1:800; Santa Cruz Biotechnology, B-2), rabbit anti-RAD21 
(1:1,000; Bethyl, A300-080A). Antibodies directed against Xenopus 
proteins: rabbit anti-Mad2 (raised against full-length protein), four 
different rabbit anti-Sgol (raised against amino acids 200-300, 300- 
400, 400-500, and 500-600 of X. laevis Sgol)’, rabbit anti-separase™. 
Other antibodies: mouse anti-Myc (hybridoma supernatant 1:50; 
DSHB, 9E10), rat anti-HA (1:2,000; Roche, 3F10), rabbit anti-ovalbumin 
(1:1,000; ThermoFisher, PA1-196), mouse anti-ubiquitinylated proteins 
(1:1,000; Millipore, FK2), mouse anti-GFP”, and mouse anti-a-tubulin 
(hybridoma supernatant 1:200; DSHB, 12G10). For immunoprecipi- 
tation experiments, the following affinity matrices and antibodies 
were used: mouse anti-Myc agarose (Sigma-Aldrich, 446), mouse 
anti-Flag M2-agarose (Sigma-Aldrich), mouse anti-RAD21 coupled to 
protein G sepharose (GE Healthcare). Rabbit anti-separase (human), 
rabbit anti-securin®’, rabbit anti-SGO2 and rabbit anti-SGO1 (human) 
were coupled to protein A sepharose (GE healthcare). To precipitate 
X. laevis separase from CSF (cytostatic factor) extract, the correspond- 
ing separase antibody or nonspecific rabbit IgG was coupled to mag- 
netic protein A Dynabeads (Invitrogen). For non-covalent coupling of 
antibodies to beads, 10 pl of the respective matrix was rotated with 2-5 
pg antibody for 90 min at room temperature and then washed three 
times with LP2 lysis buffer. For immunofluorescence microscopy (IFM), 
rabbit anti-Cap-E (antic-CAKSKAKPPKGAHVEV) and mouse anti-RAD21 
(1:500; Millipore, 05-908) were used. Isolated centrosomes were 
stained with rabbit anti-centrin-2, guinea-pig anti-C-Nap1 and mouse 
anti-y-tubulin (Sigma-Aldrich, GTU-88) as previously described. 
Secondary antibodies (all 1:500): Cy3 donkey anti-guinea pig IgG and 
Cy3 goat anti-rabbit IgG (Invitrogen), Marina-Blue goat anti-mouse lgG 
(ThermoFisher), Alexa Fluor 488 goat anti-rabbit IgG, and Alexa Fluor 
488 goat anti-mouse IgG (both Invitrogen). 


Celllines 

hTERT RPE-1 cells were purchased from ATCC (CRL-4000). Hek293 
Flp-In TRex cells were purchased from Invitrogen (R78007). All other 
cell lines were gifts: Hek293T from M. W. Kirschner, HeLa-K from 
D. Gerlich, and securin knockout and parental HCT116 from C. Lengauer. 
Validation procedures for purchased cell lines are as described by the 
corresponding manufacturers. All other cell lines were authenticated 
via visual inspection of typical morphology, immunoblotting analyses 
(for example, absence of securin), cell synchronization behaviour, 


efficiencies of different transfection reagents and resistance to certain 
antibiotics. Cell lines were not tested for mycoplasma contamination 
but microscopic inspections of their fluorescently labelled DNA con- 
tents were inconspicuous. 

All human cells were cultured in Dulbecco’s modified Eagle’s medium 
(DMEM; Biowest) supplemented with 10% fetal calf serum (FCS; 
Sigma-Aldrich) at 37 °C and 5% CO,. Generation of a stably transgenic 
Hek293-FlpIn-TRex line (Invitrogen) expressing Myc,-separase(S1126A) 
upon induction with doxycycline has been described™. For transient 
expression of Flag,-Tev,-SGO2 variants (WT, R153D, N53I, MI114R, 
RKK124-126A, LSE127-129A, HSDQ239-242A) Hek293T or HeLa-K cells 
were transfected with the corresponding pCS2-based plasmids using 
acalcium phosphate-based method or Lipofectamine 2000 (Invitro- 
gen). For time-lapse experiments, HeLa cells stably expressing histone 
H2B-eGFP were used. In addition, H2B-mCherry-SCC1,o7-743-@GFP” 
was transiently transfected to visualize premature separase activation. 


Cell treatments 

For synchronization in early S-phase, cells were treated with 2 mM 
thymidine (Sigma-Aldrich) for 20 h. Synchronization of cells in pro- 
metaphase was done by addition of taxol (LC Laboratories) to 0.2 
pg/ml 6h after release from a single thymidine block. G2 arrest was 
achieved by addition of 10 pM RO-3306 (Santa-Cruz Biotechnology) 4 
hafter G1/S release. To analyse cells in G1-phase, cells were collected 15 
h after release from a single thymidine block. For the ‘taxol-ZM over- 
ride’ experiments, taxol-arrested mitotic HeLa-K cells were collected 
by shake-off and released for the indicated times by reseeding into 
medium supplemented with ZM-447439 (5 uM, Tocris Biosciences), 
taxol (0.2 pg/ml), cycloheximide (30 pg/ml, Sigma-Aldrich) and, where 
indicated, with proTame (6 pM, Boston Biochemicals) and Apcin (20 
EM, Tocris Bioscience). To further enrich the endogenous separase- 
SGO2-MAD2 complex for later isolation, securin was depleted by RNAi. 


Immunoprecipitation 

We lysed 1 x 10’ cells with a dounce homogenizer in 1 ml LP2 lysis 
buffer (20 mM Tris-HCl pH 7.7, 100 mM NaCl, 10 mM NaF, 20 mM 
B-glycerophosphate, 5mM MgCL,, 0.1% Triton X-100, 5% glycerol), com- 
bined with benzonase (ad 30 U/I; Santa-Cruz Biotechnology), and incu- 
bated them onice for 1h. To preserve phosphorylation, lysis buffer was 
additionally supplemented with 50 nM calyculin A (LC-Laboratories) 
and1uM microcystin LR (Alexis Biochemicals). Corresponding lysates 
were centrifuged at 2,500g for 10 min followed by incubation of 1 ml 
cleared lysate with 10 ul of antibody-loaded beads for 4 hor overnight at 
4 °Cand washed 5x with LP2. Insome cases, immobilized human SGO2 
variants (purified from Gi cells) or endogenous separase-SGO2-MAD2 
complex (purified from mitotic cells) were used as starting material 
for further experiments before bound proteins were eluted by boil- 
ing in SDS-sample buffer. For consecutive immunoprecipitation from 
PTTGI™ cells, the lysate used for the first purification was kept and 
served as origin for the second precipitation. 

For immunoprecipitation of X. laevis separase, CSF-arrested Xenopus 
egg extract was prepared as previously described* and combined with 
cycloheximide (100 pg/ml), recombinant human A90-cyclin B1 (23 
ng/ul = 500 nM)” and sperm nuclei (2,000 pl”). After 15 min at room 
temperature, the egg extract was released into anaphase II by addition 
of CaCl, (0.6 mM). Previously prepared mock- or X. laevis separase 
antibody-coupled magnetic beads were equilibrated in CSF-XB fol- 
lowed by addition of anaphase extract (minimal volume of 500 ul) and 
consecutive incubation for 45 min at 18 °C. After re-isolation, beads 
were washed 5x in CSF-XB supplemented with 300 mM NaCl and 0.01% 
Triton X-100. 


RNA interference 


For efficient knockdown, cells were calcium phosphate or 
RNAiMax (Invitrogen) transfected with 70-100 nM siRNA duplex 


of PTTG1: 5’-UCUUAGUGCUUCAGAGUUUGUGUGUAU-3’; SGO2: 5’-GAA 
CACAUUUCUUCGCCUATT-3’; MAD2: 5’-GAGUCGGGACCACAGUUUA 
UU-3’; MADI: 5’- AACCAGCGGCUCAAGGAGGUU-3’; P31": 5’-GGCU 
GCUGUCAGUUUACUUTT-3’; TRIP13:5’-CUGAUGAAGUGUCAGAUCA-3’; 
TRP: 5’-GGGUGAAGAUAGUAAUGAAUCTT-3’. Transfected cells were 
grown for 12-24 h before synchronization procedures were applied. 
Luciferase siRNA (GL2) was used as negative control (Ctrl). 


Immunofluorescence staining and microscopy 

Hela-K cells transfected with the indicated siRNAs were grown on 
poly-lysine coated glass coverslips and processed 8 h after thymidine 
release in the presence of BI-2536 (10 nM) to slow down prophase in 
early mitotic cells. To remove soluble proteins, cells were pre-extracted 
(PBS, 0.3% Triton X-100) for 5 min, washed once with PBS and fixed 
with fixation solution (PBS, 3.7% formaldehyde, 0.3% Triton X-100) for 
10 min at room temperature. Subsequently, coverslips were washed 
twice with quenching solution (PBS, 100 mM glycine), incubated with 
permeabilization solution (PBS, 0.5% Triton X-100) for 5 min, washed 
once with PBS and then incubated in blocking solution (PBS, 1% (w/v) 
BSA) for 1h at room temperature. Coverslips were transferred into a 
wet chamber and incubated with primary antibodies for 1 h followed 
by four washes with PBS-Tx (PBS, 0.1% Triton X-100). After incubation 
with fluorescently labelled secondary antibodies for 40 min, samples 
were washed once, stained for 10 min with 1 pg/ml Hoechst 33342 in 
PBS-Tx and washed again four times. Finally, coverslips were mounted 
in 20 mM Tris-HCl pH 8.0, 2.33% (w/v) 1,4-diazabicyclo(2.2.2)octane, 
78% glycerol ona glass slide. IFM of fixed cells was performed ona DMI 
6000 inverted microscope (Leica) using a HCX PL APO 100x/1.40-0.70 
oil objective. To identify early mitotic nuclei, DNA morphology (com- 
mencing condensation) and condensin staining intensity were exam- 
ined. For representative images, Z-stack series over 4 um in 0.2-um 
increments were collected, deconvoluted and projected into one plane 
using the LAS-AF software. Chromosome spreads were prepared using 
Canoy’s solution as described®. Spreads were observed with the Zeiss 
Axioplan 2 Imaging microscope using a Plan-APOCHROMAT 100*/1.40 
Oil objective. A cell was counted as suffering from PCS when exhibit- 
ing loss of cohesion of >50% of its chromosomes. At least 100 spreads 
were counted per condition. To assess centriole engagement status, 
centrosomes were isolated from RO-3306 arrested HeLa-K cells (4 x 10°) 
36 h after transfection of indicated siRNAs and stained as previously 
described*. 


Clonogenic assays 

Twelve hours after siRNA transfection, HeLa-K, Hek293T, HCT116 
parental or PTTGI™ cells were seeded in 10-cm dishes (100 cells per 
plate and condition). Cells were grown for 10 days and then fixed in 
ice-cold methanol for 10 min. Staining was performed as described*® 
except for the use of ethanol instead of glutaraldehyde. The number 
of colonies per plate with a minimal area of 20 (circularity 0.00-1.00) 
was determined with Image] particle analysis software. 


In vitro disassembly of separase-SGO2-MAD2 complex 

Ten microlitres of immobilized separase-SGO2-MAD2 complex was 
incubated with 2 mg/ml recombinant variants of TRIP13 (WT; A, Walker 
A mutant G184A; B, Walker B mutant E253Q) and RGS-His,-p31°™* 
(WT; QF, Q83A/F191A MAD2-binding deficient; PK, P228A/K229A 
TRIP13-binding deficient) in EDTA/EGTA-free lysis buffer supplemented 
with ATP (1 mM) ina final volume of 30 pl for 30 min at 18 °C. Subse- 
quently, 15 pl of the corresponding supernatant was removed for later 
analysis. Beads were washed three times with LP2 and then equilibrated 
in cleavage buffer (10 mM Hepes-KOH pH 7.7, 50 mM NaCl, 25 mM NaF, 
1mMEGTA, 20% glycerol). To monitor separase activity, 2 pl of in vitro 
translated *®S-RAD21-GFP was added to a volume of 30 ul. Following 
incubation for 30 min at 30 °C, reactions were stopped by boiling in 
SDS-sample buffer. Assaying disassembly by in vitro ubiquitylation 


(Extended Data Fig. 10) was performed essentially as described® with 
the exception that instead of securin, 10 pl immobilized separase— 
SGO2-MAD2 complex was added as substrate. 


Bacterially expressed proteins 

pET28-vector encoded, His,-SUMO,-tagged” variants of TRIP13 
(NP_004228) and p31°™* (NP_055443) were expressed individually in 
E. coliRosetta 2 DE3 (Novagen). Bacteria were lysed in LP1 (PBS, 5 mM 
imidazole, 0.5 mM DTT and an additional 400 mM NaCl) and purified 
over Ni”*-NTA-agarose (Qiagen) according to standard procedures. 
Following elution with PBS supplemented with 250 mM imidazole, 0.5 
mM DTT and an additional 400 mM NaCl (pH adjusted to 7.5 with HCI), 
proteins were dialysed at 4 °C against LP1in presence of His,-SENP2 (10 
ng per 100 pg of protein) and then rotated for 3 h over 0.9x the amount 
of fresh Ni?*-NTA-agarose. Supernatants containing pure TRIP13 and 
p31°°™* were dialysed against 50 mM Hepes-KOH pH 7.7, 10% glycerol, 
100 mM NaCl, 5 mM MgCl,, 1 mM EDTA and ImM DTT. Human MAD2 
and X. laevis Mad2 were purified as described previously’. 


H2A-peptide binding assay 

Thr-9-phosphorylated and unmodified histone H2A-peptides (QAVLLP- 
KKTESHHKAKGK) were obtained from Bachem; Ser-10-phosphorylated 
or unmodified histone H3-peptides were purchased from Eurogentec 
(AS-61702 and AS-64611). One milligram of each of the H2A peptides 
was conjugated to 10 mg/ml maleimide-activated ovalbumin (Ther- 
moFisher) according to the manufacturer’s instructions and dialysed 
against CSF-XB* containing 0.5 mM DTT. Reactions (final volume 
of 30 pl) containing 10 pl immobilized FLAG,-Tev,-SGO2 and 1 pg 
ovalbumin-coupled-H2A peptide, 5 pg free H2A or H3 peptide, or 4 
pg of recombinant MAD2 were assembled and incubated for 30 min at 
18 °C. Unbound peptide or protein was removed by four washes with 
CSF-XB and the second reaction was assembled with the correspond- 
ing counterpart and again incubated for 30 min at 18 °C. Samples were 
again washed (four times with CSF-XB containing 0.01% Triton X-100) 
before beads were eluted by boiling in SDS-sample buffer. 


Mapping experiments in X. laevis Sgo1 

N-terminally tagged Myc,- or FLAG,-tagged X. laevis Sgol variants were 
in vitro translated in rabbit reticulocyte lysate (TNT Quick, Promega) 
according to the manufacturer’s protocol. For mapping with the help 
of interaction-blocking antibodies, 16 1 Sgol was combined with 2.5 pg 
anti-X. laevis Sgol and 5 wg recombinant X. laevis Mad2 and incubated 
for 30 min at 18 °C. Then, 10 pl of magnetic beads loaded with X. laevis 
separase (isolated from anaphase egg extract) was added and reactions 
were incubated for 30 min at 18 °C. Beads were washed four times with 
CSF-XB, 0.01% Triton X-100 and twice with CSF-XB, 0.01% Triton X-100, 
300 mM NaCland finally eluted by boiling in SDS-sample buffer. In all 
other cases 12.5 pl Sgol and 5 pg Mad2 were pre-incubated in 300 ul 
CSF-XB, 0.01% Triton X-100 followed by addition of 10 pl immobilized 
separase. 


Separase inhibition and cleavage assays 

For separase inhibition, 10 pl of FLAG,-Tev,-SGO2 bound to anti-FLAG 
beads and 4 pg recombinant MAD2 were incubated in cleavage buffer 
(total volume of 28 pl) for 10 min at 18 °C followed by the addition of 
2.5 pl active (Ac, S$1126A variant) or protease dead (PD, $1126A and 
C2029S) separase”. After a 10-min incubation at room temperature, 
2 plin vitro translated, *S-labelled RAD21-GFP were added. Reactions 
were stopped after 30 min by boiling in SDS-sample buffer. Samples 
were separated by SDS-PAGE and blotted onto PVDF membrane 
(SERVA), which was cut and analysed by immunoblotting before reas- 
sembly and autoradiography. For Fig. 3b, 2 pl of each in vitro trans- 
lated, *S-labelled SGO2 variant was combined with 1 pg human MAD2 
and, where indicated, 0.1 pg PLK1 (ProQinase; No. 0183-0000-1) in 
modified cleavage buffer (10 mM Hepes-KOH pH 7.7, 50 mM NaCl, 25 
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mM NaF, 20% glycerol, 1mM ATP, 10 mM MgCl) and a total volume of 
15 pl. After 15 min at 30 °C, 2.5 pl separase (Ac or PD) was added and 
reactions were incubated for 30 min at room temperature. For Extended 
Data Fig. 9a, 2 pl of each in vitro translated, *°S-labelled X. laevis Sgo1 
variant was combined with 1 pg X. laevis Mad2 (WT or AC10) and 10 pl 
immobilized X. laevis separase. Samples were further processed and 
analysed as described above. 


Live-cell imaging 

Cells in phenol red-free medium were seeded into p-slide 8-well cham- 
bered coverslips (Ibidi) and kept in an atmosphere of 37 °C and 5% 
humidified CO, during microscopy ona DMI 6000 inverted microscope 
(Leica). For imaging of unperturbed mitosis, GFP and DIC images were 
captured 6 h after release from thymidine arrest at 10-min intervals 
over a period of 15h, through a HCX PL APO 40x/0.85 CORR objective. 
Changes in focus plane due to mitotic rounding of the cells were com- 
pensated by collecting Z-stacks at each time point. Captured images 
from each experiment were analysed using the corresponding LAS-AF 
software (Leica). 


Statistics and reproducibility 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized. For quantitative analyses of chro- 
mosome spreads, clonogenic assays, and IFM specimen the investi- 
gators were blinded to sample allocation. Experiments analysed by 
immunoblotting were repeated 2-4 times with similar results (2-4 
biological replicates). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 
All source data for this study are available online. 
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Extended Data Fig. 1| Mammalian SGO2 interacts with separase. a, Hek293T 
cells were co-transfected with expression vectors for the following mouse 
proteins: GFP-separase, Flag-SGO2 and securin. Immunoprecipitation was 
carried out from transfected cells with either anti-Flag or anti-GFP antibodies 
and analysed by immunoblotting with the indicated antibodies. b, Mouse 
separase does not interact with human SGO2 and, therefore, cannot be used to 
study separase regulation in human cells. Securin-depleted Hek293T cells 
expressing GFP-tagged human separase(S1126A) or mouse separase(S1121A) 
were taxol-arrested and then subjected to IP-immunoblotting analyses using 
the indicated antibodies. c, SGO2 and MAD2 interact specifically with separase 
in untransformed cells. Taxol-arrested RPE1 cells were subjected to IP- 


=== extracts 


immunoblotting analyses as indicated. d, Eveninsecurin-expressing cells, a 
considerable fraction of separase is sequestered by SGO2. Taxol-arrested 
Hek293T, HCT116, and HeLa-K cells were subjected to IP-immunoblotting 
analyses using the indicated antibodies. ab 1, anti: DVPPRESHSHSDQSSKC 
(corresponding to amino acids 230-245 of human SGO2); ab 

2, anti-KSEDLSSERTSRRRRC (corresponding to amino acids 1,234-1,249 of 
human SGO2); rb, rabbit; mo, mouse. Given beloware the relative intensities 

(in per cent) of the separase signals (sum of full-length and N-terminal 
auto-cleavage fragment). Note the considerable co-depletion of separase upon 
SGO2immunoprecipitation from HeLa-K (right). 
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separase in Xenopus. The indicated variants of in vitro-expressed X. laevis detected by immunoblotting (middle and right). R170A, Mad2-binding- 
(X.L) shugoshins and an excess of E.-coli-expressed Mad2 (to mimic SAC deficient Sgol; AC, Sgol-binding-deficient, C-terminally truncated Mad2. 
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parental HCT116 cells (b), and Hek293T cells (c) transfected with the indicated siRNAs. Bars show percentages of colony numbers relative to the control of three 
independent experiments (dots). 
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Extended Data Fig. 4| Depletion of securin and SGO2, but not the 
individual knockdowns, results in impaired chromosome alignment, 
premature loss of chromosomal cohesin and unscheduled separase 
activity. a, HeLa-K cells transfected with the indicated siRNAs anda histone 
H2B-eGFP expression plasmid were observed by video fluorescence 
microscopy to assess metaphase plate formation (bottom). Bars show mean of 
three independent experiments (dots) counting at least 50 mitotic cells each 
(top). Scale bar, 5 um. b, Eight hours after release from thymidine arrest, HeLa-K 
cells transfected with the indicated siRNAs were pre-extracted, fixed, and 
examined by fluorescence microscopy for cohesin-negative early mitotic 
chromatin. Left, representative images; right, bars show mean of three 
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independent experiments (dots) counting prophase nuclei that were still 
round but already stained positive for condensin (100 each). Scale bar, 5 m.c, 
Hek293T cells transfected with the indicated siRNAs were supplemented with 
taxol (but not EGCG; compare Fig. 2b) and analysed by chromosome spreading. 
Bars show mean of three independent experiments (dots). d, Exemplary 
immunoblots of cells analysed inc and Fig. 2b. Star denotes nonspecific band. 
e, siRNA-transfected HeLa-K cells expressing a separase activity sensor 
(H2B-mCherry-RAD21!°’ 76°-eGFP) were released froma taxol arrest by 
addition of ZM-447439 at time zero and analysed by time-resolved 
immunoblotting. 
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Extended Data Fig. 5 | Identification of SGO2-binding-deficient separase 
variants. Taxol-arrested Hek293T cells expressing transgenic Myc-tagged 
wild-type separase (WT) or one of the indicated variants (V1-V5) were analysed 
by IP-immunoblotting using the indicated antibodies. The investigation of 


variants: changes: amino acid positions: 
V1 (ABBA) FxVFxE to AxAAxA 1362-7 
V2 (non-cleavable): ExxR to RxxE 1483-6, 1503-6, 1532-5 
V3 (PP2A-1): A55 1419-73 
V4 (PP2A-2): AEEEL 1490-3 
V5 (PP2A-3): MxxIxEE to AxxAxAA 1485-91 

A1419-73 
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PP2A <=> Separase <=> Sgo2 «> Mad2 
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PP2A-binding-deficient variants was motivated by the notion that SGO2- 
MAD2, like PP2A, preferentially interacts with full-length rather than 
auto-cleaved separase’® (see, for example, Fig. 3c). V3 cannot bind PP2A but can 
bind SGO2-MAD2, indicating overlapping but not identical binding sites. 
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Extended Data Fig. 6| Depletion of securin and Sgo2 but not the individual 
knock-downs result in premature disengagement of centrioles. HeLa- 
K cells transfected with the indicated siRNAs were released froma thymidine 


block and arrested in G2 phase with the CDK1 inhibitor RO-3306. 


Corresponding lysates were used for immunoblotting (bottom right) and 
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centrosome isolation followed by immunofluorescence microscopy (top right, 
representative images) to assess the degree of centriole disengagement as 
revealed by two C-Nap1 foci. Left, bars show mean of three independent 
experiments (dots) counting 100 centrosomes each. Scale bar, 11m. 
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phosphorylated histone H2A blocks subsequent MAD2 binding. a, 
Experimental scheme for experiments shown in b-d. Beads loaded with the 
indicated Flag-SGO2 variants were consecutively incubated first with 
phosphorylated or unphosphorylated H2A or H3 peptide (input 1) and then 
with wild-type or, where indicated, C-terminally truncated (AC) MAD2 (input 2). 
Following washing (supernatant 2), bound proteins were visualized by 
immunoblotting. b, Usage of free H2A peptides. c, Usage of ovalbumin-coupled 


H2A peptides to facilitate their detection by standard glycine-SDS-PAGE and 
immunoblotting. d, Phosphorylated histone H3 peptide does not bind to 
SGO2. Free H2A and H3 peptides, used to interrogate immobilized SGO2, were 
separated by Tricine-SDS-PAGE” and analysed by Coomassie staining and 
immunoblotting. e, Pre-charging of SGO2 with MAD2 blocks subsequent 
phospho-H2A binding. Experimental scheme (top) and corresponding 
immunoblotting analysis (bottom). H2A peptides were coupled to ovalbumin 
for ease of detection. f, Cartoon illustrating that SGO2 can bind to H2A 
phosphorylated on Thr121 or to MAD2 but not both at the same time. 
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Extended Data Fig. 8 |Mapping separase-binding sites on X. laevis Sgol.a, 
Covering amino acids 200-300 of X. laevis Sgol with polyclonal antibodies 


impairs its binding to separase. Region-specific polyclonal X. laevis Sgol 


antibodies A—-D were characterized (left) and added toin vitro-expressed X. 


laevis Sgoland E.-coli-expressed MAD2 (prey) as indicated. Immobilized X. 


laevis separase isolated by immunoprecipitation from anaphase egg extracts 


was then addedas bait to these mixtures. Separase beads were washed and 


finally probed for associated proteins by immunoblotting. Mock, unspecific 
IgG; asterisk, unspecific band. b,c, Identification of two sites within X. laevis 
Sgol that are relevant for separase binding. Different in vitro translated (IVT) X. 


laevis Sgol fragments and variants thereof were combined with wild-type 


N-terminal xSgo1 fragments 


0 50 100 150 200 250 Separase | Mad2 

predicted coiled coil MIM binding: | binding: 
1-179 ++ n.d. 
2 1-207 ++ n.d. 
5 99 - 207 ++ ++ 
& 107 - 179 + n.d. 
2 121 - 200 : nd. 
= 139 - 240 - n.d. 
= 139 - 270 ++ ++ 
99 - 270 ++ n.d. 
| MIM | LRrrL-99-103-AArrA ++ n.d. 
2 LCS-143-145-AAA Eoomm - ++ 
S RLE-189-191-AAA MiM| §j ++ n.d. 
* NK-248-249-AA MIM | + ++ 
< LE-264-265-AA MIM | ++ n.d. 
RRR-254-256-AAA MIM | 7 ++ 


MAD2 or MAD2-AC as indicated (prey). Separase beads as ina were combined 
with these mixtures, washed and analysed for associated proteins by 
immunoblotting. d-f, X. laevis Sgo1(143-145A) and Sgo1(254-256A) (red) show 
compromised separase binding but retain MAD2 binding. d, The indicated 
full-length X. laevis Sgol variants were assessed for MAD2-dependent separase 
binding as inb,c.e, Theindicated X. laevis Sgol fragments and variants thereof 
were combined with wild-type MAD2 or MAD2-AC, immunoprecipitated via 
their Myc-tag and assessed for MAD2 binding by immunoblotting. f, Summary 
of the mapping experiments. MIM, Mad2-interaction motif; n.d., not 
determined. 
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Extended Data Fig. 9 | X. laevis Sgo1 and human SGO2 share the same order 
and spacing of separase- and MAD2-binding sites. a, A point mutationturns 
X. laevis Sgol into a Mad2-dependent separase substrate. *S-labelled X. laevis 
Sgol variants were incubated with wild-type Mad2 or Mad2-AC before being 
assayed for in vitro-cleavage by X. laevis separase, which had been 
immunoprecipitated from anaphase egg extracts. Gels were blotted onto 
membranes, which were cut and subjected to immunoblotting (top) before 
reassembly and autoradiography (bottom). b, Human (H.s.) SGO2(127-129A) 


and SGO2(239-242A) show compromised separase binding but retain MAD2 
binding. The indicated full-length SGO2 variants were immunoprecipitated via 
their Flag-tags from transfected, taxol-arrested Hek293T cells and assessed by 
immunoblotting for binding of MAD2 and separase. See also illustration at 
bottom. c, Cartoon comparing the arrangement of functional domains and 
alignment of pseudo-substrate and Mad2-binding sites of X. laevis Sgoland 
human SGO2. The separase interaction site around position 144 of X. laevis 
Sgoland around position 128 of human SGO2 was omitted for clarity. 
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Product No. M8069; Sigma); guinea pig anti-TRIP13 (1.5 ug/ml; raised by Charles River Laboratories against full length Trip13), 
rabbit anti-p31comet (2ug/ml; raised by Charles River Laboratories against isoform 2 of full length p31comet), mouse anti-APC7 
1:800; Product No. PA5-20948; Thermo Scientific), rabbit anti-phosphoSer10-histone H3 (1:1,000; Product No. 06-570; Lot No. 
2370127; Millipore), mouse anti-PP2A-C (1:1,000; clone 1D6; Product No. 05-421; Millipore), mouse anti-cyclin B1 (1:1,000; 
Product No. 05-373; Lot. No. 2199734; Millipore), goat anti-Cdc27 (1:1000) see Ref. in Material and Methods, mouse anti- 
topoisomerase Ila (1:1,000; clone 1C5; Product No. ADI-KAM-CC210-E; Enzo Life Sciences), mouse anti-cyclin A2 (1:200; clone 
46B11; Product No. sc-53234; Santa Cruz Biotechnology), mouse anti-Rad21 (1:800; clone B-2; Product No. sc-271601; Santa 
Cruz Biotechnology), rabbit anti-Rad21 (1:1,000; Product No. A300-080A; Bethyl). Antibodies directed against Xenopus proteins: 
Four different rabbit anti-Sgo1 (all 1 ug/ml used in WB; raised against amino acids 200-300, 300-400, 400-500, and 500-600 of 
X.]. Sgo1) see Ref. in Material and Methods, rabbit anti-separase (1:1000) see Ref. in Material and Methods, rabbit anti-Mad2. 
Other antibodies: Mouse anti-Myc (hybridoma supernatant 1:50; DSHB, 9E10), rat anti-HA (1:2,000; clone 3F10; Product No. 
11867423001; Roche), rabbit anti-ovalbumin (1:1,000; Product No. PA1-196; Thermo Fisher Scientific), mouse anti- 
ubiquitinylated proteins (1:1,000; clone FK2; Product No. 04-263; Lot No. 3117294; Millipore), mouse anti-GFP (hybridoma 
supernatant 1:2,000; gift from D. van Essen and S. Saccani), and mouse anti-a-tubulin (hybridoma supernatant 1:200; DSHB; 
clone 12G10). 


Validation Antibodies are validated using multiple methods (e. g. ICC/IF, WB and IP). To address antibody specificity we used the 
corresponding recombinantly expressed antigen as positive- and RNAi-mediated depletion from human cells as negative 
controls. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) 


Authentication 


Mycoplasma contamination 


Commonly misidentified lines 
(See ICLAC register) 


hTERT RPE-1 cells were purchased from ATCC (CRL-4000). Hek293 Flp-In TRex cells were purchased from Invitrogen (R78007). 


All other cell lines were gifts: Hek293T from Marc W. Kirschner (Harvard Medical School), HeLa-K from Daniel Gerlich (IMBA), 
SECURIN knock-out and parental HCT116 from Christoph Lengauer (Johns Hopkins). 


Validation procedures for purchased cell lines are described by the corresponding manufacturers. All other cell lines were 
authenticated via visual inspection of typical morphology, by immunoblotting analyses (e.g. absence of securin), cell 


synchronization behavior, efficiencies of different transfection reagents and resistance to certain antibiotics. 


Cell lines were not tested for mycoplasma contamination but microscopic inspections of their fluorescently labeled DNA 
contents were inconspicuous. 


No commonly misidentified cell lines were used. 
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® Check for updates 


Prolonged mitosis often results in apoptosis’. Shortened mitosis causes tumorigenic 
aneuploidy, but it is unclear whether it also activates the apoptotic machinery?. 
Separase, a cysteine protease and trigger of all eukaryotic anaphases, hasa 
caspase-like catalytic domain but has not previously been associated with cell death“. 
Here we show that human cells that enter mitosis with already active separase rapidly 
undergo death in mitosis owing to direct cleavage of anti-apoptotic MCL1 and BCL-XL 
by separase. Cleavage not only prevents MCL1 and BCL-XL from sequestering 
pro-apoptotic BAK, but also converts them into active promoters of death in mitosis. 
Our data strongly suggest that the deadliest cleavage fragment, the C-terminal half of 


MCLI, forms BAK/BAX-like pores in the mitochondrial outer membrane. MCL1 and 
BCL-XL are turned into separase substrates only upon phosphorylation by NEK2A. 
Early mitotic degradation of this kinase is therefore crucial for preventing apoptosis 
upon scheduled activation of separase in metaphase. Speeding up mitosis by 
abrogation of the spindle assembly checkpoint results in a temporal overlap of the 
enzymatic activities of NEK2A and separase and consequently in cell death. We 
propose that NEK2A and separase jointly check on spindle assembly checkpoint 
integrity and eliminate cells that are prone to chromosome missegregation owing to 
accelerated progression through early mitosis. 


The intrinsic pathway of apoptosis is regulated by a balance between 
pro- and anti-apoptotic BCL2 family proteins that are hallmarked by 
presence of one to four BCL2 homology (BH) domains*®. Pore formation 
by homo-oligomerization of BAK and BAX leads to mitochondrial outer 
membrane permeabilization (MOMP) and release of cytochromec and 
other apoptogenic factors from the intermembrane space. MOMP 
is counteracted by family members such as BCL2 itself, BCL2-like 1 
(BCL-XL) and myeloid cell leukaemia 1 (MCL1). These proteins use a 
hydrophobic groove formed by their BH1-3 domains to sequester the 
BH3 domain of BAK/BAX and inhibit their self-interaction. BH3-only 
proteins suchas BIM or BAD activate BAK/BAX either directly by tran- 
sient interaction or indirectly by forcing BAK/BAX off anti-apoptotic 
BCL2 members through competition®. Intrinsic apoptosis in response 
to excessive cellular stress, suchas DNA damage, is initiated by activa- 
tion of BH3-only proteins, typically via upregulated transcription*’. 
Other triggers of intrinsic apoptosis are less well understood. Likewise, 
it remains unclear whether proteins other than BAK/BAX might also 
be able to form pores and contribute to MOMP. 

Separase is the essential trigger protease of all eukaryotic anaphases*. 
Once activated in metaphase, it opens the DNA-embracing cohesin 
ring complex by cleavage of the kleisin subunit, thus resolving sister 
chromatid cohesion and enabling chromosome segregation. Separase 
contains a C-terminal caspase-like proteolytic domain’, but it has not 
been functionally linked to apoptosis. For most of the cell cycle, spindle 
assembly checkpoint (SAC) signalling ensures that human separase is 
held inactive by association with securin, SGO2-MAD2 or CDK1-cyclin 
B15”. In response to improper attachment of kinetochores to spindle 


microtubules, the SAC delays activation of separase and other late 
mitotic events, thereby giving the cell time for error correction”. 
Hyperstimulation of the SAC by spindle toxins such as taxol (paclitaxel) 
prolongs mitosis”. Conversely, SAC impairment results in shortened 
mitosis, chromosomal instability and tumorigenesis”. While prolonged 
mitosis is known to result in death in mitosis (DiM) or mitotic slippage 
followed by apoptosis in interphase’, it remains unstudied whether 
shortened mitosis might also trigger intrinsic apoptosis. 


DiM upon premature separase activation 

Both depletion of the cohesion-protecting factors SGO1 or sororinand 
derepression of separase (by co-depletion of SGO2 and securin) result in 
premature sister chromatid separation’”””” (Extended Data Fig. 1a). This 
was followed by prolonged mitotic arrest of cells lacking SGO1 or sororin, 
as previously described”. However, cells lacking SGO2 and securin 
exhibited DiM as judged by cleavage of poly(ADP-ribose) polymerase 
(PARP) and fluorogenic caspase reporters (Extended Data Fig. 1b-h). 
DiM in cells depleted of SGO2 and securin was specific because it was 
suppressed by concomitant knockdown of separase (Extended Data 
Fig. 1g, h). Thus, premature activity of separase rather than premature 
sister chromatid separation represents an apoptotic stimulus. 


MCL1and BCL-XL are separase substrates 


We speculated that MCL1 and BCL-XL could be relevant targets of sepa- 
rase in DiM because both (1) have previously been linked to apoptosis 
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Fig. 1| The pro-survival factors MCL1 and BCL-XL are separase substrates. 
a, b, Autoradiographies of in vitro cleavage assays in the presence of active 
NEK2A and ATP. WT, wild type; HFHF, His,-Flag-His,-Flag tag; ATM, 
transmembrane domain deleted. c, siRNA-transfected, taxol-arrested 
Hek293T cells were treated with Q-VD-OPh or mock-treated and analysed by 
immunoblotting. siPT7TGI, securin depletion. d, immunoblots of parental (—/-) 
and MCL1™" heterozygous (-/+) hTERT RPE1 cells after (+) or without (-) 
rapamycin-induced TEV protease complementation. H3-pS10, 
Ser10-phosphorylated histone H3 (mitotic marker). 


after prolonged mitotic arrest ”; (2) have been reported to be cleaved 
by caspases'®”; and (3) contain an ExxR motif that matches the con- 
sensus cleavage site of separase close to the caspase cleavage site(s). 
Indeed, human BCL-XL and MCL1 were cleaved in vitro not only by 
caspase 3 but also by separase, and the resulting fragments were clearly 
distinguishable in size (Fig. 1a, b). Replacing 31-ExxR-34 in BCL-XL and 
173-ExxR-176 in MCL1 with tobacco etch virus (TEV)-protease cleav- 
age sites required only a few amino acid exchanges (Extended Data 
Fig. 2a). This rendered both survival factors resistant to separase but 
susceptible to TEV protease, while leaving cleavage by caspase 3 unaf- 
fected (Fig. 1a, b). The proteolytic fragments that were generated by 
either separase or TEV protease exhibited identical mobilities in SDS- 
PAGE, thereby indicating that separase cleaves BCL-XL after Arg34 and 
MCL1after Arg176. Various in vitro-expressed fragments were used as 
length standards to confirm the location of the cleavage site for MCL1 
(Extended Data Fig. 2b). In mice, the ExxR motif is conserved in BCL-XL 
but has been replaced with DxxRin MCL1. Still, mouse MCL1 was readily 
cleaved by separase in vitro (Extended Data Fig. 2c). BCL-XLand MCL1 
were also cleaved in human Hek293T, HeLa-K, HCT116, hTERT RPEland 
mouse NIH/3T3 cells during DiM triggered by RNA interference (RNAi) 
using small interfering RNAs (siRNAs) against SGO2 and PTTGI1 (which 
encodes securin) (Extended Data Fig. 2d-g; see also below). Notably, 
these in vivo cleavages were mediated by separase rather than caspase 
because (1) Q-VD-OPh, a pan-specific caspase inhibitor, blocked cleav- 
age of PARP but not of BCL-XL and MCL1 (Fig. 1c); (2) fragmentation was 
not detectable when the endogenous proteins were replaced by their 
separase-resistant but caspase-sensitive TEV variants (Extended Data 
Fig. 2d); and (3) MCL1 fragments from Hek293T cells lacking SGO2 and 
securin perfectly co-migrated with in vitro-expressed fragments com- 
prising amino acids 1-176 (MCL1-N) and 177-350 (MCL1-C) (Extended 
Data Fig. 2f). In fact, we never observed caspase-dependent processing 
of MCL1 or BCL-XL in vivo, although this might be due to our focus on 
early stages of DiM. Using CRISPR-Cas9 gene editing in hTERT RPE1 
cells, we replaced the separase cleavage site with a TEV protease cleav- 
age site in one allele of MCL1. Upon activation of ‘split TEV protease’ by 
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Fig. 2 | Cleavage of MCL1 by separase enables BH3-only proteins to liberate 
BAK. a, Immunoblots of (TEV protease supplemented, +) lysates and Flag 
immunoprecipitation of transfected, mitotic Hek293T cells expressing the 
indicated, C-terminally His,-Flag-His,—Flag (HFHF)-tagged MCL1 fragments 
together with HA-tagged BAK and BIM. b, Model of how separase inactivates 
MCLI. 


rapamycin-induced complementation, half of endogenous MCL1 was 
cleaved, as expected (Fig. 1d). This was accompanied by considerable 
PARP cleavage, but only when the cells were in mitosis. Thus, MCL1 
cleavage at position 176 is sufficient to initiate DiM during a prometa- 
phase arrest. 


Cleavage of MCL1 and BCL-XL liberates BAK 


We investigated whether separase-dependent cleavage affected the 
interactions of MCL1 and BCL-XL with other BCL2 family members. To 
this end, we used Flag tags to affinity-purify MCL1 and the correspond- 
ing C-terminal separase cleavage fragment from transfected Hek293T 
cells and analysed them for association with co-expressed BAK relative 
to the BH3-only protein BIM. Whereas MCLI preferentially interacted 
with BAK, as expected, MCL1-C bound only to BIM (Fig. 2a). An analo- 
gous experiment was conducted with BCL-XL but using BAD instead 
of BIM owing to the different binding preference of its C-terminal frag- 
ment (Extended Data Fig. 3a). Removal of the N-terminal 34 amino acids 
switched BCL-XL from binding BAK to binding BAD (Extended Data 
Fig. 3b). For MCL1, the exchange of binding partners upon cleavage 
was additionally recapitulated by addition of TEV protease to cell lysate 
containing the TEV variant of MCL1 (Fig. 2a). The BH4 domain of BCL-2 
contributes directly to BAX binding”°. Similarly, MCL1-N co-purified 
with BAK from transfected Hek293T cells (Extended Data Fig. 3c), show- 
ing that MCL1 contacts BAK not only via its hydrophobic groove but 
also via its N-terminal domain. We propose that cleavage by separase 
abolishes the cooperativity of binding, thereby enabling BH3-only 
proteins suchas BIM to supersede BAK from the C-terminal fragment 
of MCLI (Fig. 2b). Liberated BAK would then lead to MOMP. 


MCLI1-N and MCL1-C actively promote apoptosis 


We made the counter-intuitive observation that DiM induced by siRNAs 
against SGO2and PTTGI1 was alleviated by co-depletion of MCLI (Fig. 3a, 
b) and almost fully rescued by co-depletion of both MCL1 and BCL-XL 
(Extended Data Fig. 3d, e). While this showed that these two BCL2 family 
proteins are the crucial—if not the only—substrates of separase during 
DiM, it also suggested that the separase cleavage fragments of MCL1 
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Fig. 3 | Both separase cleavage fragments of MCL1are pro-apoptotic and 
MCL1-C forms macropores during late stages of DiM. a, b, HeLa-K cells 
transfected with the indicated siRNAs and expression plasmids were released 
from early S-phase arrest, supplemented with the DNA stain SiR-Hoechst anda 
fluorogenic caspase-3/7 reporter and analysed by immunoblotting (a;10h 
after release) and live cell imaging (b; cell fate profiles). c, Immunoblots of 


and BCL-XL are apoptogenic. Consistent with this hypothesis, the miti- 
gation of PARP cleavage by depletion of BCL-XL or MCL1 (Extended 
Data Fig. 2d, lanes 4-6) was reversed by expression of siRNA-resistant 
transgenic BCL-XL or MCL1 but not their separase-resistant TEV vari- 
ants (lanes 7-10). Given the stronger effects of MCL1 depletion, we 
further studied its cleavage fragments individually. Full-length MCL1, 
MCL1-N or MCL1-C was expressed in Hek293T cells in which endogenous 
MCL1 (but not SGO2 or securin) was depleted by RNAi. Short-term 
absence of MCL1 caused very little apoptosis on its own but led to 
some cell death in conjunction with chemical inhibition of BCL-XL 
by WEHI-539 (Fig. 3c, lanes 5 and 6). This phenotype was rescued by 
transgenic full-length MCLI (lane 9). Notably, expression of MCL1-N 
or MCLI-C resulted in PARP cleavage even in the absence of WEHI- 
539, with the C-terminal fragment having the stronger effect (lanes 
11 and 14). PARP cleavage correlated with annexin V and propidium 
iodide staining as additional markers for apoptosis (Extended Data 
Fig. 3f, g). Induction of apoptosis by MCL1-N or MCL1-C did not occur 
ininterphase but only in taxol-treated cultures (Fig. 3c, Extended Data 
Fig. 3f). Time-lapse microscopy further illustrated that HeLa-K cells 
expressing MCLI-N or MCL1-C also underwent DiM in the absence of 
spindle toxin (Extended Data Fig. 3h, i). Thus, separase-dependent 
cleavage not only extinguishes the pro-survival activity of MCL1 but 
also creates two fragments, each of which kills cells upon entry into 
mitosis without requiring prolonged mitotic arrest. Investigation of 
MCLI-N-induced apoptosis suggested that it promotes both libera- 
tion of BAK and separase-dependent cleavage of BCL-XL by a positive 
feedback mechanism (Extended Data Fig. 4). 


MCL1-C kills by mitosis-specific MOMP 

Interaction analyses of epitope-tagged forms of MCLI revealed that 
MCLI-C exhibited homotypic interactions, whereas uncleaved, 
full-length MCL1 could associate neither with itself nor with MCL1-C 
(Extended Data Fig. 5a). Self-interaction of MCLI-C and its ability to 
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MCL1- or control-depleted HeLa-K cells expressing transgenic full-length MCL1 
or fragments thereof as indicated. d, Immunofluorescence 2D SIM of SGO2- 
and securin-depleted Hek293T cells undergoing DiM. Note the absence of the 
mitochondrial outer membrane marker TOM20 fromthe centres of MCL1 
rings. Scale bars, 0.5 um. 


induce PARP cleavage were blocked by deletion of the transmembrane 
domain or presence of the BH3-mimicking MCL1-inhibitor A-1210477, 
which blocked not only binding of BAK to full-length MCL1 but also 
binding of BIM to MCLI1-C (Extended Data Fig. 5b-d). Tandem affin- 
ity purification of co-expressed, differently tagged MCL1-C further 
revealed that homo-oligomerization and BIM binding are mutually 
exclusive (Extended Data Fig. 5d), which suggests that BIMinteracts with 
MCLI-C only transiently and disengages upon MCL1-C self-interaction. 
These observations are reminiscent of the requirements for BAK/ 
BAX-dependent MOMP”!”’ and, thus, are consistent with pore forma- 
tion by MCL1-C. Notably, homotypic MCL1-C interactions occurred 
only in extracts from mitotic, and not interphase, cells (Extended 
Data Fig. 5b); this is consistent with the finding that the pro-apoptotic 
effect of MCL1-C is cell-cycle-dependent. According to our hypoth- 
esis, Separase-induced DiM should be delayed or diminished, but still 
occur, in the absence of BAK and BAX, whereas MCL1-C-induced DiM 
should be independent of them. Using time-resolved fractionation of 
chromatin and organelles, including mitochondria, from the cytosol 
followed by immunoblotting analyses, we compared parental HCT116 
and BAKI‘’ BAX? double-knockout cells as they went synchronously 
from G2- through M-phase. Whereas the kinetics of MCL1-cleavage 
were indistinguishable in the absence of SGO2 and securin, the release 
of cytochrome cinto the cytosol and PARP cleavage were delayed, but 
still occurred, in the absence of BAK and BAX (Extended Data Fig. 6a, 
b). In fact, PARP cleavage and accumulation of Ser139-phosphorylated 
histone H2A-X (yH2AX) in the absence of BAK and BAX were less affected 
during DiM induced by siRNAs against SGO2 and PTTGI than during 
staurosporine-induced apoptosis” (Extended Data Fig. 6c). Notably, 
the timing and extent of cytochrome c release and PARP cleavage were 
the same in both cell lines when DiM was induced by MCL1-C expression 
(Extended Data Fig. 6d). 

At later stages of intrinsic apoptosis, the mitochondrial network 
fragments into globular structures, and BAK and BAX form large rings 
and macropores within the outer membrane®’. Immunofluorescence 


microscopy of MCL1 and the MOM protein TOM20 in taxol-arrested 
Hek293T cells showed that MCL1 and TOM20 colocalized during DiM, 
and this colocalization increased when the network dissolved into 
spheres (Extended Data Fig. 7a, b). Notably, 2D structural illumination 
microscopy (SIM) revealed the formation of large, typically 0.3-m 
rings by MCL1-C in SGO2- and securin-depleted cells undergoing DiM 
(Fig. 3d, Extended Data Fig. 7c). TOM20 appeared largely absent from 
the centre of the rings, suggesting that they represent macropores. 

Phosphorylation might explain why the MOMP activity of MCL1-Cis 
specific to mitosis. A candidate approach identified Thr301. Changing 
this residue to phosphorylation-mimicking Glu enabled MCL1-C to 
induce apoptosis also in interphase, whereas changing Thr301 to Ala 
abrogated the pro-apoptotic and self-interaction properties of MCL1-C 
(Extended Data Figs. 5d, 7d, e). The detection of affinity-purified 
MCLI-C bya phospho-Thr-specific antibody was limited to mitosis and 
abolished by the Thr301Ala mutation, suggesting that this position is 
phosphorylated in vivo (Extended Data Fig. 7d)—possibly by auroraB 
(Extended Data Fig. 7f, g). Existing structural information places Thr301 
of MCL1 at the end of a-helix 6, and dimerization of the correspond- 
ing helix has been reported to be involved in homo-oligomerization 
of BAK”, Sequence alignment implies that the Thr301-equivalent 
position is occupied by a Gluin BAX (Extended Data Fig. 7h). When we 
changed this constitutively negatively charged residue at position 
146 to Ala, the pro-apoptotic function of BAX was abrogated, whereas 
changing it to Thr rendered BAX a largely mitosis-specific effector of 
cell death (Extended Data Fig. 7i). A salt bridge at the end of a-helix 6 
might therefore be required for pore formation (Extended Data Fig. 7j) 
and explain why MCLI1-C becomes pro-apoptotic only upon phospho- 
rylation of Thr301 during mitosis, whereas MOMP by wild-type BAX is 
independent of the cell cycle. 


Importance of phosphorylation by NEK2A 


In unperturbed cells, MCL1 and BCL-XL are present at the onset of ana- 
phase. The question arises of why then cells do not die when separase 
becomes active on schedule. Considering that cleavage of meiotic 
kleisin by separase requires its phosphorylation”’, we tested the effect 
of various kinases when first establishing MCL1 and BCL-XL cleavage 
by separase in vitro (Extended Data Fig. 8). These analyses revealed 
that cleavage of MCL1 by separase essentially requires NEK2A, while 
cleavage of BCL-XL was enabled by NEK2A and (less so) CDK1/2-cyc- 
lin A2 (note that NEK2A was included in Fig. 1a, b and Extended Data 
Figs. 2b, c, 4c). NEK2A and CDK1/2-cyclin A are special among mitotic 
kinases and APC/C substrates in that they are degraded early—that is, 
ata time when separase activation is still blocked by SAC signalling”. 

To identify phosphorylation sites within MCL1 and BCL-XL that are 
relevant for cleavage, we changed candidate serine and threonine 
residues to alanine or phosphorylation-mimicking acidic residues 
and screened corresponding variants in kinase and/or cleavage assays. 
These analyses revealed the following (Extended Data Fig. 9): (1) the 
NEK2A-dependent phosphorylation of Ser60 and Thr163 is essential for 
separase-dependent cleavage of MCLI, and phosphorylation of Ser159 
further improves it. (2) NEK2A and CDK1/2-cyclin A2 phosphorylate 
Ser4 and Ser164, and Ser62, respectively, of BCL-XL to enable its cleav- 
age by separase. (3) The phosphorylation-mimicking variants MCLI(S/ 
T60,159,163D/Q) and BCL-XL(S4,62,164D) are cleaved by separase in 
the absence of kinases. 

The above findings suggested that separase does not trigger DiM 
at anaphase onset merely because NEK2A (and cyclin A2) is absent by 
then and MCL1 and BCL-XL—owing to dephosphorylation—no longer 
represent separase substrates (Fig. 4a, top). As a corollary, any tempo- 
ral overlap between the enzymatic activities of NEK2A and separase 
should cause DiM. Indeed, this explains why constitutive activity 
of separase causes early mitotic cell death and why knockdown of 
NEK2A largely suppressed DiM induced by siRNAs against SGO2 and 


PTTG1 (Extended Data Fig. 1g, h). Extending the window of NEK2A 
activity until separase activation in anaphase should also cause 
DiM (Fig. 4a, middle). Live cell imaging of transfected HeLa-K cells 
revealed that overexpression of wild-type NEK2A was compatible 
with normal mitosis, whereas production of a C-terminally trun- 
cated, stabilized variant (AMR)” triggered DiM, typically shortly after 
anaphase onset (Extended Data Figs. 3i, 10a). This was confirmed by 
time-resolved immunoprecipitation and western analysis of cells 
synchronously undergoing late mitosis. NEK2A-AMR-expressing 
and mock-treated populations both degraded cyclin Bland securin 
and lost SGO2 from separase with similar kinetics (Extended Data 
Fig. 10b). However, only in NEK2A-AMR-containing cells did activa- 
tion of separase coincide with cleavage of MCL1and PARP. Consistent 
with MCL1 and BCL-XL being the relevant targets, apoptosis in ana- 
phase was also triggered by expression of the constitutive separase 
substrates BCL-XL(S4,62,164D) or MCL1(S/T-60,159,163D/Q) instead 
of NEK2A-AMR (Extended Data Fig. 10c, d). Thus, NEK2A must be 
degraded in early mitosis to prevent separase from killing cells in 
anaphase. 


Separase-induced DiM increases with MCL1 


To investigate whether DiM in response to stabilization of NEK2A 
was graded with MCL1 dosage, we transfected siRNA or plasmids into 
Hek293T cells to express wild-type NEK2A or the AMR variant and, 
simultaneously, to reduce or increase the amount of MCL1. When 
transgenic NEK2A was wild-type and, hence, degraded upon entry 
into mitosis, or when NEK2A-AMR expression was combined with 
MCL1 depletion, cells showed no signs of apoptosis (Fig. 4b, Extended 
Data Fig. 1la—c). As seen before, NEK2A-AMR induced some DiM at 
endogenous levels of MCL1. However, annexin V staining and PARP 
cleavage increased with increasing levels of (transgene-encoded) 
MCL1and cleavage fragments thereof. These data suggest that phar- 
macological inhibition of early mitotic NEK2A degradation should 
preferentially kill MCL1-overexpressing cells, which are a hallmark 
of many cancers”. 


A minimal duration of mitosis checkpoint 


The SAC is active in each M-phase and chiefly determines its duration”®. 
Abrogation of the SAC results in chromosomal instability owing to 
accelerated progression through mitosis”. We investigated whether, 
under these conditions, separase might become active when NEK2A 
has not yet been fully degraded (Fig. 4a, bottom). When MAD2 and 
the SAC kinase BUBRI were depleted by RNAi, HeLa-K cells that had 
been released from a thymidine arrest degraded securin and cyclin 
Blearlier than control cells; this correlated with earlier auto-cleavage 
of separase, as expected (Fig. 4c). At the same time, degradation of 
NEK2A and the disappearance of a corresponding MCL1-S60 phos- 
phorylation mark were delayed, which we attribute to competition 
by other substrates for the APC/C (Fig. 4c, Extended Data Fig. 11d). 
Notably, this was accompanied by cleavage of MCL1, BCL-XL and PARP 
and appearance of a sub-G1 peak in flow cytometry, which is another 
hallmark of apoptotic cells. As seen before, these phenotypes were 
largely suppressed by co-depletion of MCL1 and BCL-XL and fully sup- 
pressed by additional expression of separase-resistant TEV variants 
of MCL1 and BCL-XL, but re-installed by transfection with the corre- 
sponding wild-type transgenes. Cleavage of MCL1, BCL-XL and PARP 
also occurred upon individual depletion of MAD2 or BUBR1, albeit to 
lesser extent (Extended Data Fig. le). It also occurred in both mouse 
and human cells upon chemical inhibition of the SAC kinase MPS1 
with reversine (Fig. 4d, Extended Data Fig. lle). Thus, in mammalian 
cells SAC abrogation suffices to induce cleavage of MCL1 and BCL-XL 
by separase and consequent DiM. A corollary is that the few existing 
SAC-deficient tumour cell lines (unless slowed in mitotic progression 
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DNA content 


Fig. 4| DiM due to simultaneous activity of NEK2A and separase is graded 
by MCLI level and triggered by SAC deficiency. a, Both prolonged activity of 
NEK2A (middle) and speeded-up mitosis due to SAC deficiency (bottom) cause 
DiM. b, Fluorescence microscopic quantification of annexin V- and propidium 
iodide (PI)-positive Hek293T cells transfected with siMCL1 and expression 
plasmids for GFP-MCL1and NEK2A-WT or NEK2A-AMR, as indicated, and 
released for 10-12 h from thymidine arrest (thy. rel.). Expression of MCLland 
degree of PARP cleavage were quantified by densitometry of immunoblots 
(Extended Data Fig. 11b). c, HeLa-K cells transfected with the indicated siRNAs 


by SAC-independent means**) should have found a way to avoid cleav- 
age of MCL1 and BCL-XL. We immunoprecipitated MCL1 from four can- 
cer cell lines: partially SAC-compromised SW480, SAC deficient T47D 
and A427 and, asacontrol, SAC-proficient A549 cells"”*>°, Notably, only 
MCL1 from A549 cells was cleaved upon incubation with NEK2A and 
separase (Fig. 4e). It is unclear how MCL1 from the other three cell lines 
is rendered separase-resistant, but it is not due to the mere absence of 
p53 (Extended Data Fig. 11f). Cancer-associated MCLI1 variants seem to 
be rare. However, of the few catalogued in the Catalogue of Somatic 
Mutations in Cancer (COSMIC) database”, deletion of Glu171 is by 
far the most abundant one, being identified ten times in six studies. 
Althoughit leaves the ExxR motif intact, unexpectedly, this mutation 
renders MCL1 resistant to separase (Fig. 4f). 


Conclusion 

In most cases studied, the intrinsic pathway of apoptosis is triggered by 
transcriptional upregulation of BH3-only protein expression”. Here, 
we describe a mechanism of DiM, which is probably conserved in ver- 
tebrates (Extended Data Fig. 12) and consists of separase-dependent 
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and expression plasmids were released from thymidine arrest and analysed 
by immunoblotting and propidium iodide staining with flow cytometry 

8-12 hthereafter. siBUBIA, BUBR1 depletion; siBCL2L1, BCL-XL depletion. 

d, Time-resolved immunoblots of NIH/3T3 cells transfected with indicated 
siRNAs, pre-synchronized with thymidine, and treated with Q-VD-OPh and/or 
reversine, undergoing mitosis. Asterisk, nonspecific band. e, Immunoblots of 
lysates and NEK2A/separase-treated Mcllimmunoprecipitations from the 
indicated cell lines. f, Autoradiograph of in vitro-expressed, NEK2A/ 
separase-treated *°S-MCL1-WT and *°S-MCL1-AE171. 


cleavage of MCL1 and BCL-XL and their concurrent transformation 
from pro-survival into mitosis-specific pro-apoptotic factors. Our 
results strongly suggest that the deadliest fragment, MCL1-C, per- 
meabilizes the mitochondrial outer membrane by forming pores. 
Because separase-dependent cleavage of BCL-XL is also sufficient 
for DiM, the same might be true of BCL-XL-C. Degradation of MCL1 
was causally linked to apoptosis upon mitotic arrest, but exactly how 
MCL1is removed remains unknown”. Although we have not done 
so here, it will therefore be interesting to investigate whether MCL1 
somehow becomes phosphorylated and separase-activated during 
prolonged mitosis. Here we studied shortened mitosis. Our results 
show that abrogation of SAC results in DiM owing to simultaneous 
activity of NEK2A and separase (Fig. 4a, bottom). We propose that 
NEK2A and separase form a surveillance mechanism that eliminates 
SAC-deficient cells that would otherwise be doomed to massive chro- 
mosomal instability and aneuploidy, thereby ensuring the survival of 
cells with the correct length of M-phase and protecting the organism 
from tumorigenesis. This ‘minimal duration of early mitosis check- 
point’ (DMC) might explain why mutational inactivation of SAC genes 
in cancer is rare®?®, 
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Methods 


Antibodies 

Antibodies generated within this study were raised and purified as 
described’. The following antibodies were used for immunoblotting 
and immunofluorescence microscopy (IFM): Rabbit anti-separase”’, 
rabbit anti-sororin”, rabbit anti-PINI“, goat anti-CDC27”, mouse 
anti-GFP”, mouse anti-Flag (1:2,000; Sigma-Aldrich, M2), rabbit 
anti-SGO2 (1:1,000; Bethyl, A301-262A), rabbit or guinea pig anti-SGO2 
(1.5 pg/ml; anti: DVPPRESHSHSDQSSKC), rabbit anti-SGO1 (1:500; 
Abcam, ab21633), mouse anti-securin (1:1,000; MBL, DCS-280), rab- 
bit anti-phosphoSer10-histone H3 (‘H3-pS10’; 1:1,000; Millipore, 
06-570), mouse anti-cyclin B1 (1:1,000; Millipore, 05-373), rabbit 
anti-cleaved caspase 3 (Asp175; 1:1,000; Cell Signaling, 5A1E), rabbit 
anti-BCL-XL (1:1,000; Cell Signaling, 2762), mouse anti-MCL1 (1:800; 
BioLegend, W16014A), guinea-pig anti-MCL1 (0.75 pg/ml; for IFM 
1.5 pg/ml; raised against amino acids 1-327 (ATM) of human MCL1I), 
guinea-pig anti-phosphoSerl0-MCL1 (‘MCL1-pS60’; 0.5 pg/ml; for 
IFM 1 pg/ml; anti-CVIGGpSAGA, liberated from reactivity towards 
CVIGGSAGA), anti-TOM20 (1:500; Santa-Cruz Biotechnology, F10), 
mouse anti-cytochromec (1:1,000; BD Pharmingen, 7H8.2C12), mouse 
anti-Bubr1 (1:1,000; BD Transduction Laboratories, clone 9), rabbit 
anti-BAX (1:1,000; Abcam, ab32503), rabbit anti-BAK (1:1,000; Abcam, 
ab32371), rabbit anti-MAD2 (1:1,000; Bethyl, A300-300A), rabbit 
anti-PARP (1:800; Cell Signaling, 46D11), mouse anti-PARP (1:1000; 
Calbiochem, AM30), anti- MBP monoclonal (1:1,000; NEB Biolabs, 
E8038S, HRP-conjugated), mouse anti-NEK2 (1:600; BD Transduc- 
tion Laboratories, clone 20), mouse anti-RGS-—His, (1:1,000; Qiagen 
34610), rabbit anti-phosphoSer139-histone H2A.X (‘YH2AX’; 1:5,000; 
Millipore, EP854(2)Y), mouse anti-topoisomerase Ila (1:1,000; Enzo Life 
Sciences, 1C5), mouse anti-cyclin A2 (1:200; Santa Cruz Biotechnology, 
46B11), rat anti-HA (1:2,000; Roche, 3F10) and mouse anti-a-tubulin 
(hybridoma supernatant 1:200; DSHB, 12G10). Nonspecific rabbit, 
mouse and guinea pig IgGs were from Sigma-Aldrich. For immunopre- 
cipitation experiments, the following affinity matrices and antibodies 
were used: mouse anti-Flag agarose (Sigma-Aldrich, M2), rat anti-HA 
agarose (Roche, clone 3F10), anti-GFP nanobody covalently coupled 
to NHS-agarose (GE Healthcare), mouse anti-RGS—His, or guinea 
pig anti-MCL1 coupled to protein G sepharose (GE Healthcare) and 
rabbit anti-BCL-XL coupled to protein A sepharose (GE Healthcare). 
For non-covalent coupling of antibodies, 10 pl of the respective matrix 
was rotated with 2-5 pg antibody in the presence of 1% w/v BSA (Roth) 
for 90 min at room temperature and then washed three times. Sec- 
ondary antibodies for immunoblotting were horseradish peroxidase 
(HRP)-conjugated goat anti-rabbit, anti-mouse and anti-guinea pig 
IgGs (Sigma-Aldrich, all used at 1:20,000). The following secondary 
antibodies were used for IFM (all 1:500): Cy3 donkey anti-guinea pig 
IgG, Cy5 goat anti-mouse IgG (both Jackson Immunoresearch Labo- 
ratories), Alexa Fluor 488 goat anti-mouse IgG (Invitrogen). Antisera 
against mouse SGO2 and mouse securin were gifts from A. M. Pendas. 


Cell lines and plasmids 

HeLa-K, Hek293T, hTERT RPE-1, HCT116, HCT116 BAK“, BAX”, HCT116 
TP53-, SW480, and NIH/3T3 cells were cultured in Dulbecco’s modified 
Eagle’s medium (DMEM) (Biowest), T-47D cells in RPMI-1640 (Biowest) 
and A427 and A549 cells in essential minimum Eagle’s medium (EMEM) 
(Roth). All media were supplemented with 10% heat-inactivated fetal 
calf serum (FCS) (Sigma-Aldrich). Cells were cultured at 37 °C in 5% 
CO,. Xenopus laevis S3 cells were grown at 27 °C under atmospheric 
CO, in 70% Leibovitz’s L-15 medium (Gibco) supplemented with 1% 
Glutamax (Gibco) and 10% heat-inactivated FCS. MCL1 (NM_021960) 
was PCR-cloned from human testis cDNA (Clontech), BCL-XL (BCL2L1, 
NM _138578), BAK1 (BCL2L7,NM_001188), BJM (BCL2L11, NM_001204106) 
and BAD (BCL2L8, NM_032989) were PCR-cloned from self-made HeLa 
cell cDNA. MCL1 from X. laevis (NP_001131055), Xenopus tropicalis 


(XP_002935512) and Danio rerio (NP_571674) were PCR-cloned from 
self-made oocyte cDNA, zygote cDNA (a gift from C. Niehrs) and 72-h 
embryos (a gift from P. Braaker), respectively. Details about the result- 
ing plasmids and derivatives thereof are available upon request. The 
ZipGFP plasmid was a gift from X. Shu (Addgene plasmid #81241)**. 
For transient expression of proteins, Hek293T, HeLa-K or hTERT RPE-1 
cells were transfected with corresponding pCS2- or pcDNAS-based 
plasmids using a calcium phosphate-based method or Lipofectamine 
2000 (Invitrogen) according to the manufacturer’s protocol. X. laevis 
S3 cells were transfected using PEI (Polysciences; 3 pl of lug/pl perl pg 
DNA). MCLI’“™" hTERT RPE1 cells were generated using the co-CRISPR 
approach* with 0.8 1M ouabain octahydrate (Sigma) 72 h after trans- 
fection with 30 pmol ssODN MCL1"" (5’-GCTGGAGTTGGTCGGGGAATC 
TGGTAATAACACCAGTACGGACGGGTCACTACCCTCGACGCCGCCGCC 
AGCAGAGGAGGAGGAGGACGAGAACT TGTACT TCCAGTCGCTCGAGA 
TTATCTCTCGGTACCTTCGGGAGC-3’), 10 pmol ssODN ATPIA1 and 
lpg eSpCas9_ATPIA1 _G3_Dual_sgRNA (a gift from Y. Doyon, Addgene 
plasmid #86613) encoding the corresponding sgRNA for MCL1™” 
(5’- CGAGTTGTACCGGCAGTCGC-3’). Ouabain-resistant clones 
were screened by PCR with oligonucleotides (fwd: 5’-GAGTTC 
GCTGGCGCCACCCCGTAGGACT-3’, rev: 5’-GGGAGTGAGGCCTTGG 
CGATTAATGAACCCCCTT-3’) and the resulting 871-bp fragment was 
further analysed for the presence of a Xhol site. 


Cell treatments 

For synchronization in early S-phase, human and frog cells were treated 
with 2mM thymidine (Sigma-Aldrich) for 20 h. Upon release/wash-out, 
cells entered mitosis within 8-10 h. Synchronization of cells in prometa- 
phase was done by addition of taxol (LC Laboratories) to 0.2 pg/ml 6h 
after release from thymidine block. G2 arrest was achieved by addition 
of 10 uM RO-3306 (Santa-Cruz Biotechnology) 4 h after release from 
thymidine arrest for 6-10 h. For release from G2 arrest, cells were trypsi- 
nized, washed 5x with fresh medium and reseeded for the indicated 
incubation times. To inhibit MCL1, BCL-XL, caspases or MPSI kinase, cells 
were supplemented with A-1210477 (2.5 uM, Abcam), WEHI-539 (0.5 uM, 
Cayman Chemicals), Q-VD-OPh (20 uM, BD Pharmingen) or reversine (lor 
5uM, Cayman Chemicals), respectively, at the time of taxol addition and 
incubated for 6h. To assess apoptosis in interphase, cells were arrested 
with thymidine for 20 h together with simultaneous transfection of cor- 
responding plasmids or siRNA, released for 15 h, re-supplemented with 
thymidine and analysed 10 h thereafter. To assess apoptosis in mitosis, 
cells were transfected 10-12 h before thymidine addition, released and 
treated with taxol as described above, and collected when morphological 
signs of apoptosis first became visible (typically 10-13 h after thymidine 
wash-out). When studying depletion of MCL1 and BCL-XL without sepa- 
rase deregulation, the taxol arrest was prolonged to 12 h before cells 
were analysed. As control, cells were treated with 1 pM staurosporine 
(Abcam) for 8-12 h. To address DiM in SAC-abrogated HeLa-K cells, the 
corresponding plasmids were transfected first. Ten hours later, siRNA 
was transfected. Thymidine was added 14 h thereafter. (In the case of 
NIH/3T3 cells, thymidine (4 mM) was added immediately after siRNA 
transfection and washed away 18 h thereafter. This was followed by 
a second thymidine block 10 h later.) After 20 h, cells were released, 
re-seeded, supplemented with reversine (5 1M unless stated otherwise) 
and incubated for the indicated times without addition of taxol. For the 
‘taxol-ZM override’ experiments, taxol-arrested HeLa-K cells were col- 
lected by shake-off and released for the indicated times by replating into 
medium supplemented with ZM 447439 (5 uM, Tocris Biosciences), taxol 
(0.2 pg/ml) and cycloheximide (30 pg/ml, Sigma-Aldrich). To induce 
FRB-FKBP heterodimerization of the split TEV, rapamycin (100 nM, 
Sigma) was added 10 h before cells were collected. 


Immunoprecipitation and subcellular fractionation 


Cells (1 x 10’) were lysed with a dounce homogenizer in 1 ml LP2 lysis 
buffer (20 mM Tris-HCl (pH 7.7), 100 mM NaCl, 10 mM NaF, 20 mM 


B-glycerophosphate, 5mM MgCl,, 0.1% Triton X-100, 5% glycerol), sup- 
plemented with benzonase (30 U/I; Santa Cruz) and complete protease 
inhibitor cocktail (Roche), and incubated on ice for 1h. To preserve 
phosphorylations, lysis buffer was additionally supplemented with 
calyculin A (SO nM, LC-Laboratories) and microcystin LR (1 uM, Alexis 
Biochemicals). If transmembrane proteins were to be analysed, the 
corresponding lysis reactions were cleared by low-speed centrifuga- 
tion (2,500g for 10 min), giving rise to whole-cell extracts (WCE). Inall 
other cases, lysis reactions were cleared by centrifugation at 16,000g 
for 30 min, resulting in lysates. For immunoprecipitations, 1 ml of WCE 
or lysate was rotated over 10 pl of antibody-carrying beads for 4-12 hat 
4 °Cand washed 5x with lysis buffer. For Extended Data Figs. 4c and 8c, 
immobilized MCL1 or BCL-XL-BAK/-BAD complexes were incubated 
with separase before boiling in SDS-sample buffer. For Fig. 4e, incuba- 
tion also included NEK2A and ATP. For cleavage of MCL1-TEVin lysate 
(Fig. 2a), 20 U of His,-TEV protease was added at 18 °C and immuno- 
precipitation was started 30 min thereafter. Intact mitochondria were 
enriched and separated from cytosol as described””. 


RNA interference 

For efficient knockdown, cells were transfected with calcium phosphate 
or RNAiMax (Invitrogen) using 70-100 nM siRNA duplex directed against 
PTTGI1 (SECURIN): 5’-UCUUAGUGCUUCAGAGUUUGUGUGUAU-3’, 
SGO2: 5’-GAACACAUUUCUUCGCCUATT-3’, ESPL1 (SEPARASE): 
5’-AACUGUUCUACCUCCAAGGUUAGAUUU-3’, NEK2A: 3’-UUCUGAG 
AGUCAGCUCACA-S’, MCLI1: 5’-CGAAGGAAGUAUCGAAUUUTT-3’, 
BCL2L1 (BCL-XL): 5’-CCAGGGAGCUUGAAAGUUUTT-3’, SGO1: 
5’-GAUGACAGCUCCAGAAAUUTT-3’, CDCAS (SORORIN): 3’- UGGAG 
GAGCUCGAGACGGA-S’, BUBIB (BUBR1): 5’°-GGACACATT TAGATG 
CACTTT-3’ and/or MAD2: 5’-GCTTGTAACTACTGATCTTTT-3’. For trans- 
fer of siRNA into NIH/3T3 cells, we used Lipofectamine 2000 (Invitro- 
gen) according to an optimized protocol supplied by the manufacturer. 
The following pre-designed siRNAs (IDT) were used: mm.RI.MCL1.13.1: 
5’-GAGUGCUGACUAGAUGAUCAACUAUUCAUCUAGUC:3’, mm.RI. 
MCL1.13.2: 5’-GCGUAAACCAAGAAAGCUUCGAUGAAGCUUUCUUGG-3’ 
(a mixture of both was used for Fig. 4d), mm.RI.PTTG1.13.1 (‘a’ in 
Extended Data Fig. 2g): 5’-UAUCUUUGUUGAUAAGGAUUCAUU 
AUCCUUAUCAAC-3’, mm.RI.PTTG1.13.2 (‘b’ in Extended Data 
Fig. 2g): 5’-AUCACCGAGAAGUCUACUGUGUCUUAGUAGACUUCU-3’, 
mm.RI.SGOL2a.13.1 (‘a’ in Extended Data Fig. 2g): 5’-ACCUCU 
UCAGUAUCAAGAAAGGUUGUCUUGAUACUG-3’and mm.RI.S 
GOL2a.13.2 (‘b’ in Extended Data Fig. 2g): 5’-GAAACUUAGAC 
AAAAGGUUCGAUUUACCUUUUGUCU-3’. Luciferase siRNA (GL2) served 
as control (Ctrl). If depletion of the corresponding protein caused DiM, 
siRNA transfection was performed during thymidine block and cells 
were collected upon entering mitosis. In all other cases transfection 
was performed on asynchronous cells 12 h before synchronization. 


Fluorescence microscopy 

To detect apoptosis in fixed samples, Hek293T cells were grown on 
poly-lysine-coated glass coverslips and processed 10-12 hafter release 
from thymidine arrest. Different staining and fixation procedures were 
carried out. Where indicated, cells were stained with annexin V-FITC 
and propidium iodide according to the manufacturer’s protocol 
(Annexin V-FITC Apoptosis Detection Kit, Abcam), washed once with 
corresponding binding buffer and mounted onto coverslips in 5 pl 
DAPI-Fix (1x MMR, 48% glycerol, 11% formaldehyde, 1 mg/ml Hoechst 
33342)*”. To label intact mitochondria, 200 nM MitoTracker (Orange 
CMTMRos, Invitrogen) was added 10 h after release from thymidine 
to Hek293T cells in serum-free cell culture medium and incubated 
for 45 min before cells were fixed. If additional antibody staining was 
performed, corresponding coverslips were washed once with PBS, fixed 
with fixation solution (PBS, 3.7% formaldehyde) for 15 min at room 
temperature, and then washed twice with quenching solution (PBS, 100 
mM glycine). Cells were then treated with permeabilization solution 


(PBS, 0.5% Triton X-100) for 5 min, washed with PBS and incubated in 
blocking solution (PBS, 1% (w/v) BSA) overnight at 4 °C. Coverslips were 
transferred into a wet chamber, incubated with primary antibodies for 
1h, washed four times with PBS-Tx (PBS, 0.1% Triton X-100), incubated 
with fluorescently labelled secondary antibodies for 1h, washed once 
with PBS-Tx, stained for 10 min with 1 pg/ml Hoechst 33342 in PBS-Tx, 
washed four times and mounted in 20 mM Tris-HCl (pH 8.0), 2.33% (w/v) 
1,4-diazabicyclo(2.2.2)octane, 78% glycerol ona glass slide. Immuno- 
fluorescence microscopy of fixed cells was performed ona DMI6000 
inverted microscope (Leica) using a HCX PL APO 100x/1.40-0.70 oil 
objective. Z-stack series were collected in 0.2-m increments over 10 
pm, deconvoluted (blind algorithm) and, where indicated, projected 
into one plane using the LAS-AF software. For 2D SIM, cells were grown 
on Precision cover glasses (Marienfeld) and imaged with the Nikon 
Eclipse Ti2 using aSR APO TIRF AC 100*H objective. Z-stacks were cap- 
tured in 0.1-~m increments over 2 1m, processed using the stack recon- 
struction mode and visualized by volume and maximum projection 
of the NIS-Elements AR software (Nikon). Chromosome spreads were 
prepared as described. For video microscopy, transfected HeLa-K or 
X. laevis S3 cells were released from single 20 h thymidine block (how 
long?), transferred into a p-Slide 8-well (Ibidi) dish, and imaged starting 
5-6h later over a period of 5-15 hin 8-10-min intervals ona DMI 6000 
inverted microscope (Leica) using a HCX PL APO 40x/0.85 CORR (HeLa) 
or HCX PL FLUOTAR L 20x/0.40 CORR PHI (S3) objective and the cor- 
responding LAS AF600 software. To visualize chromatin, SiR-Hoechst 
(200 nM)* and Verapamil (1 1M, both from Spirochrome) were added 
to the culture medium 6h before imaging. Where indicated, medium 
was additionally supplemented with IncuCyte Caspase 3/7 reagent (5 
uM, Essen Bioscience)”. 


In vitro kinase and cleavage assay 

MCL1 from different origins served as substrates for in vitro phospho- 
rylation reactions. Immunoprecipitated MCL1 bound to 10 pl beads, 3 pl 
of in vitro translated MCLIATM (TNT Quick-coupled Transcription and 
Translation kit, reticulocyte lysate-based, Promega) or 2 pg bacterially 
expressed, purified MCLIATM were combined with ‘cold’ ATP (5 uM) and 
40 Ci y-“P-ATP (Hartmann Analytic) for radioactive labelling or only 
with cold ATP (1mM) for non-radioactive phosphorylation. Reactions of 
25 yl were assembled in kinase buffer (10 mM Hepes-KOH (pH 7.7), 50 mM 
NaCl, 25 mM NaF, 1mM EGTA, 20% glycerol, 10 mM MgCl, 10 mM DTT) 
including 1 pl of NEK2A-AMR, NEK2B-AMR or A86-cyclinA2-CDK1/2 
(corresponding to 12.5 x 10° transfected Hek293T cells each; see below), 
0.4 ug of His-tagged A90-cyclinBI-CDK1", 0.1 pg of PLK1 (ProQinase, 
No. 0183-0000-1) or 0.1 1g of GST-tagged Aurora B (PRECISIO-Kinase, 
A2108). To specifically inhibit phosphorylation, the following kinase 
inhibitors were used: RO-3306 (2 uM, Santa-Cruz), BI-2536 (100 nM, 
Boehringer-Ingelheim), staurosporine (300 nM, Abcam) and ZM-447439 
(0.5 uM, Tocris). For reactions containing BCL-XLATM, 3 pIIVT served as 
substrate. To test kinase activity, 2 1g each of histone H1 (NEB) or myelin 
basic protein (MBP, Upstate Biotechnology) were used. Kinase reactions 
were incubated for 30 min at 37 °C, subjected to SDS-PAGE, blotted 
onto PVDF membrane (SERVA) and analysed by autoradiography using 
aphospho-sensitive imaging plate (Fujifilm). The same membrane was 
re-activated with methanol and further analysed by immunoblotting. 
For cleavage assays, 12.5-1l reactions were incubated with 1 pI separase 
(active P1127A variant or protease-dead (PD) C2029S variant)?", 1 U 
human caspase 3 (Enzo) or 20 U TEV protease, incubated for 30 min at 
room temperature (separase) or 37 °C (both others) and stopped by 
addition of SDS-sample buffer. 


Recombinant protein expression and purification 

To produce recombinant NEK2A-AMR (active or kinase-dead (KD, L37M), 
NEK2B-AMR and A86-cyclinA2 (in complex with endogenous CDK1/2), 
10 x 10’ Hek293T cells were transfected with the corresponding 
plasmids to express the kinases in fusion with a GFP-SUMOstar tag”, 
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supplemented with taxol 24-48 h thereafter and collected 12h later. 
Lysates in LP2 (including protease and phosphatase inhibitors) were 
cleared by centrifugation for 30 min at 16,500g and rotated for 4 hat 
4 °C with anti-GFP nanobody beads (0.1ml corresponding to 1mg nano- 
body). Immobilized kinases were washed three times in LP2 (200g, 1 
min, 4 °C), transferred into cleavage buffer (10 mM Hepes-KOH (pH 7.7), 
50 mM NaCl, 25 mM NaF, 1mM EGTA, 20% glycerol) and incubated with 
40 pg His,-SUMOstar protease” for 45 min at 18 °C. Eluates (80 pl) were 
snap-frozen and stored in aliquots at —80 °C. His,-SUMO1-MCLIATM 
was expressed froma pET28-derivative in Escherichia coli. Rosetta 2 DE3 
(Novagen). Bacteria were lysed in LP1 (PBS, 5 mMimidazole, 0.5 mM DTT 
and an additional 400 mM NaCl) and purified over Ni?*-NTA-agarose 
(Qiagen) according to standard procedures. The eluate in PBS sup- 
plemented with 250 mM imidazole, 0.5 mM DTT and an additional 400 
mM NaCl (pH adjusted to 7.5 with HCl) was dialysed against LP1 for12h 
at 4 °Cin the presence of 10 ng His,-Senp2” per 100 pg MCL1 and then 
rotated for 3 h over 9/10th the amount of fresh Ni?*-NTA-agarose. The 
flow-through containing pure MCLIATM was dialysed against PBS and 
stored in aliquots at -80 °C. 


Flowcytometry 

To measure apoptosis in vivo, Hela-K cells transfected with correspond- 
ing siRNAs and plasmids were collected by gentle trypsination, washed 
once with medium, and then stained with annexin V-FITC and propid- 
ium iodide according to the manufacturer’s protocol (Abcam, ab14.085). 
Samples were analysed immediately on a Cytomix FC 500 (Beckman 
Coulter) using an FL1 signal detector for FITC and FL2 for propidium 
iodide and counting at least 20,000 single cells per condition. Flow 
cytometry of propidium iodide-stained cells was doneas described™. 


Statistics and reproducibility 

No statistical methods were used to predetermine sample size. Experi- 
ments analysed by immunoblotting or autoradiography were repeated 
2-4 times with similar results (2-4 biological replicates). For quantita- 
tive analyses of chromosome spreads, clonogenic assays, and IFM speci- 
men, the investigators were blinded to sample allocation. Otherwise, 
experiments were not randomized. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 

The data that support the findings of this study are available within 
the paper. Source Data for Figs. 1-4 and Extended Data Figs. 1-12 are 
provided with the paper. Data or other materials are available from the 
corresponding author upon reasonable request. 
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Extended Data Fig. 1| Premature activation of separase rather than loss of 
cohesion triggers DiM. a, Premature sister chromatid separation (PCS) was 
quantified by chromosome spreading from siRNA-transfected HeLa-K cells at 
different times after release from RO-3306/G2 arrest. b, Immunoblots of cells 
froma. c-e, HeLa-K cells transfected with the indicated siRNAs and cultured in 
the presence of SiR-Hoechst* and a fluorogenic caspase 3/7 reporter® were 
analysed by video fluorescence microscopy. Shown are representative stills 


(c; scale bar, 10 tm), immunoblots (d), and cell fate profiles (e). f-h, HeLa-K cells 
expressing the caspase 3 reporter ZipGFP** were transfected with the indicated 
siRNAs, supplemented with SiR-Hoechst and followed by video microscopy. At 
least 100 cells were counted per time point and condition. Shown are 
representative stills (f), immunoblots (g), and line graphs (h) of the 
percentages of GFP-positive (apoptotic) cells. The ZipGFP plasmid also 
expresses mCherry as acontrol. Scale bar, 10 pm. 
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Extended Data Fig. 2 | Characterization of MCL1 (and BCL-XL) cleavage by taxol-arrested Hek293T cells transfected with siRNAs and expression plasmids 
separase (and caspase 3). a, Sequence stretches of wild-type BCL-XLandMCL1 _asindicated. During separase-triggered DiM, MCL1 cleavage stimulates BCL-XL 
and TEVvariants thereof. Differing amino acids are in bold. Arrowheads show cleavage and vice versa, which—at least in the case of MCL1—is mediated by the 
protease cleavage sites. b, Separase and caspase 3 cleave MCL1 after Arg176 corresponding N-terminal fragment (Extended Data Fig. 4b). e, Co-depletion of 
and Asp157, respectively, as mapped by in vitro-expressed fragments. SGO2 and securin induces cleavage of MCL1and BCL-XL followed by apoptosis 
Following incubation with NEK2A, CDK1-cyclin B1, PLK1and ATP, separase, or innon-transformed cells. Immunoblots of siRNA-transfected, taxol-arrested 
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NEK2A/separase-treated mouse *°S-MCL1. PD, protease-dead (C2029S); NIH/3T3s cells transfected with siRNAs and treated with taxol and 


Ac, active (P1127A); ATM, transmembrane domain deleted. d, Immunoblots of staurosporine as indicated were analysed by immunoblotting. 
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Extended Data Fig. 3 | Characterization of separase cleavage fragments of 
MCL1and BCL-XL. a, The C-terminal separase cleavage fragments of MCL1and 
BCL-XL preferentially bind BIM and BAD, respectively. Experimental setup and 
immunoblots of the indicated immunoprecipitation from taxol-arrested 
Hek293T cells co-expressing His,-MCL1-CATM, BCL-XL-CATM-Flag, HA-BAD, 
and HA-BIM. b, BCL-XL and BCL-XL-C bind BAK and BAD, respectively. 
Immunoblots of Flag immunoprecipitation from transfected, mitotic 
Hek293T cells expressing the indicated Flag-tagged BCL-XL fragments 
together with HA-tagged BAK and BAD. c, MCLI1-N interacts with BAK. 
Experimental setup and immunoblots of lysate and consecutive Flag and His, 
immunoprecipitations from transfected, mitotic Hek293T cells co-expressing 
Flag-TEV-BAK, His,-BAK, and MCL1-N. TEV protease supplementation of lysate 
served as a negative control. Self-interaction of BAK is mutually exclusive with 
binding of MCLI-N. d, Separase-induced DiM is suppressed by knock-down of 


MCL1iand BCL-XL. Quantification and representative images of 
siRNA-transfected, mitotic Hek293T cells cultivated in the presence of 
mitoTracker anda fluorogenic caspase 3/7 reporter before fixation and 
Hoechst staining. At least 100 cells each were counted. Scale bar, 5 um. 

e, Immunoblots of cells from d. f, MCL1-N and -C promote DiM. Plot of early 
(dark grey) and late (light grey) apoptosis as judged by flowcytometric analysis 
of propidium iodide and annexin V staining of Hek293T cells transfected with 
siRNAs and expression vectors for transgenic MCL1 (fragments) and 
supplemented with taxol and BCL-XL inhibitor WEHI-539 (WEHI) as indicated. 
g, Representative 2D scatter plots of cells from f and their interpretation. 

h, Induction of DiM by MCL1-C does not require taxol treatment but does 
require Thr301. Cell fate profiles of HeLa-K cells expressing the indicated MCL1 
fragments and cultured in the presence of SiR-Hoechst anda fluorogenic 
caspase 3/7 reporter. i, Immunoblot of cells from h and Extended Data Fig. 10a. 
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Extended Data Fig. 4| MCL1-N enables BAD to replace BAK as an interactor 
of BCL-XL and enhances cleavage of BCL-XL by separase. a, MCL1-N causesa 
switch in the binding partner of BCL-XL from BAK to BAD. Immunoblots of 
BCL-XLimmunoprecipitation from MCL1-depleted or control-treated, mitotic 
Hek293T cells co-expressing HA-BAK, HA-BAD and, where indicated, various 
MCL1 fragments. Separase was not deregulated in this experiment, whichis 
why BCL-XL is not cleaved. b, Separase-dependent BCL-XL cleavage in cells is 
primarily stimulated by MCL1-N. Immunoblots of SGO2/securin/MCL1 
triple-depleted or control-treated, prometaphase Hek293T cells expressing 
the indicated transgenic MCL1 variants. c, Separase prefers BCL-XL in complex 
with BAD asa substrate rather than BAK. Experimental setup and immunoblots 


MOMP 


of HAimmunoprecipitation from taxol-arrested Hek293T cells expressing 
NEK2A-AMR and either HA-BAK or HA-BAD plus MCL1-N. Before SDS-PAGE, 
samples were incubated with inactive (PD) or active (ac) separase or control 
treated (—).d, MCL1-N promotes apoptosis by a positive feedback mechanism. 
MCL1-N competitively displaces the BH4 domain of BCL-XL from BAK. Similar 
to cleavage by separase, this breaks the cooperative binding of BCL-XLto BAK, 
resulting in BH3-only proteins, suchas BAD, excluding BAK from BCL-XL. At the 
same time, this renders BCL-XL a better separase substrate. MCLI-N acts 
catalytically, being released from BAK upon self-interaction and pore 
formation by the latter (dotted arrow; Extended Data Fig. 3c). 
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Extended Data Fig. 5| Self-interaction of pro-apoptotic MCL1-C shares 
characteristics with pore formation by BAK/BAX but requires 
mitosis-specific phosphorylation. a—c, MCL1-C exhibits mitosis-specific 
self-interaction, which requires the transmembrane domain and an accessible 
BH3-binding groove. Immunoblots of Flag immunoprecipitation from mitotic 
or interphase Hek293T cells expressing MCL1 variants and supplemented with 
the MCL1inhibitor A-1210477 as indicated. Blockade of the MCLI-BAK 
interaction served asacontrol for the effectiveness of A-1210477. d, The 
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homotypic interaction of MCLI1-C is mutually exclusive with BIM binding. Top, 
experimental setup; bottom, immunoblots of consecutive Flag and His, 
immunoprecipitations from Hek293T cells expressing HA-BIM together with 
Flag-TEV- and His,-tagged forms of either wild-type MCL1-C or the T301A 
variant. X, irrelevant lane between HA-BIM control (asterisk) and His, 
immunoprecipitation samples. The T301A mutation prevents self-interaction 
of MCL1-C but notits association with BIM. 
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Extended Data Fig. 6 | BAK/BAX-independent release of cytochrome c by 
separase deregulation or MCLI-C expression. a, b, Time-resolved 
immunoblots of cytosol and mitochondria-containing fractions from SGO2/ 
securin-depleted or mock-transfected BAK”, BAX” and parental HCT116 cells 
released froma G2-arrest at t=O min.c, Immunoblots of taxol-arrested BAK”, 
BAX* and parental HCT116 cells that were transfected with siRNA and 
supplemented with staurosporine as indicated. Note the absence of MCL1land 


BCL-XL cleavage during staurosporine-induced apoptosis and suppression of 
siSGO2/siPTTG1-induced DiM by co-depletion of MCL1and BCL-XL. Grey lines 
within panels are between lanes that were not directly juxtaposed but 
nevertheless originated from the same gel. d, Time-resolved immunoblots of 
cytosol and mitochondria-containing fractions from Flag—MCL1-C-expressing 
BAK™, BAX“ and parental HCT116 cells released from G2-arrest at t=O min. 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7 | The pro-apoptotic effect of MCL1-C and BAX requires 
anegative charge at the end of a-helix 6. a, b, Immunofluorescence 
micrographs of taxol-arrested Hek293T cells transfected with the indicated 
siRNAs. Shown are maximum projections of 20 z-stacks (a) or asingle, 
deconvoluted plane (b). The interruption of TOM20 rings by MCL1 dots is 
consistent with a cross-section through a fragmented mitochondrion 
containing an MCLIring. Scale bars, 3 4m.c, MCL1-Cis likely to form 
macropores into the mitochondrial outer membrane. Immunofluorescence 2D 
SIM of SGO2/securin-depleted Hek293T cells undergoing DiM. Note the 
absence of the mitochondrial outer membrane marker TOM20 fromthe 
centres of MCLI rings. Scale bar, 0.5 um. d, Immunoblots of extracts and Flag 
immunoprecipitations from interphase or mitotic Hek293T cells expressing 
Flag-tagged MCL1-C-WT (T), MCL1-C(T301A) (A), or MCL1-C(T301E) (E). 

e, Identifying serine and threonine residues that affect the pro-apoptotic 
nature of MCLI1-N and MCL1-C. Immunoblot of MCL1-depleted, taxol-arrested 
Hek293T cells expressing the indicated siRNA-resistant variants of MCL1-N or 
MCL1-C. BCL-XL is not cleaved during apoptosis if separase remains inhibited. 


f,g, AuroraB kinase phosphorylates MCL1-C in vitro, probably at position 301 
primarily. Immunoblots and autoradiographs of in vitro-translated wild-type 
MCL1-CATM and variants thereof after incubation with the indicated kinases 
and inhibitors in the presence of y*P-ATP. Activity of the recombinant kinase 
was confirmed using model substrates (f, lower panels). KD, kinase-dead; RO, 
RO-3306; BI, BI-2536; ZM, ZM-447439; MBP, myelin basic protein; H1, histone 
H1.h, Local alignment of MCL1and BAX. Vertical lines and colons mark 
identical and chemically similar residues, respectively; dashes represent gaps. 
i, Immunoblots of interphase or mitotic Hek293T cells expressing BAX with (+) 
or without (A) TM and with Glu (E), Ala (A), or Thr (T) at position 146.j, Model of 
MOMP by MCL1-C homo-oligomerization. The indicated conformational 
changes are inspired by knowledge about BAK/BAX pore formation and their 
hierarchy was chosen to best explain our data; that is, why the T301A mutation 
abolished self-interaction of MCL1-C but left BIM binding unaffected 
(Extended Data Fig. 5d). Minus signs represents phosphorylated Thr301and 
plus signs represent a nearby basic residue. 
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Extended Data Fig. 8 | See next page for caption. 
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Extended Data Fig. 8 | NEK2A kinase turns MCL1 and BCL-XL into separase 
substrates. a, Autoradiographs and immunoblots of combined kinase 
@P-labelling) and cleavage (fragment-generation) assays using bacterially 
expressed, purified MCLIATM and kinases, specific inhibitors and active (ac) or 
inactive (PD) separase, as indicated. b, CDK1-cyclin Bland PLK1 enhance 
separase-dependent cleavage of NEK2A-phosphorylated MCL1. Immunoblots 
of cleavage assays using bacterially expressed, purified MCLIATM and the 
indicated combination of kinases and separase variants. c, When 
immunoprecipitated from NEK2A-AMR-expressing, SAC-arrested cells, 
endogenous MCL1is efficiently cleaved by separase in vitro. Left, experimental 
setup; right, immunoblots of cleavage assay combining active or inactive 
separase with MCL1immunoprecipitation from prometaphase Hek293T cells 
expressing MCLIATM-Flag and, where indicated, Flag-NEK2A-AMR or 
N-terminally truncated (A86) Flag-cyclin A2. d, Cleavage of mouse (M.m) MCL1 
by separase also requires NEK2A-dependent phosphorylation. 

In vitro-translated °S-MYC6-M.m.MCLIATM was incubated with separase, 


caspase 3 and NEK2A/ATP as indicated. Reactions were resolved by SDS-PAGE 
and analysed by autoradiography and immunoblotting. KD, kinase-dead 
(K37M).e, NEK2A and (toa lesser extent) CDK1/2-cyclin A2 sensitize BCL-XL to 
separase. Autoradiography of cleavage assay combining in vitro-translated 
35§-RAD21 (positive control) or °S-BCL-XLATM with kinases/ATP and separase 
variants as indicated. f, Autoradiography of kinase assays (*°P-labelling) using 
model substrates and the kinases from d. g-i, The NEK2A-related NEK2B does 
not support separase-dependent MCLI cleavage. g, Schematics and C-terminal 
sequences (dashed box) of NEK2A and NEK2B, which arise by alternative 
splicing of the same gene™. NEK2A-specific, C-terminal degrons (KEN box and 
MR-tail) are underlined. h, Both NEK2A and NEK2B can phosphorylate the 
model substrate MBP. i, NEK2B cannot phosphorylate MCL1. Left, 
experimental setup; top right, kinase assay; bottom right, cleavage assay 
combining MCLIATM with NEK2A, NEK2B, staurosporine (kinase inhibitor) 
and active separase as indicated. 
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Extended Data Fig. 9 | Mapping cleavage-relevant phosphorylation sites 
within MCL1and BCL-XL. a, b, Cleavage of MCL1 by separase essentially 
requires NEK2A-dependent phosphorylation of Ser60 and Thr163. 
Autoradiographs and immunoblots of combined kinase (?P-labelling) and 
cleavage (fragment-generation) assays. Prior to analysis, in vitro-translated, 
wild-type MCLIATM and variants thereof were incubated with active NEK2A, 
y°P-ATP, staurosporine, and active (ac) or inactive (PD) separaseas indicated. 
In vivo phosphorylation of Ser159 and Thr163 has previously been reported>°. 
In vivo phosphorylation of Ser60 was detected by a phosphorylation-specific 
antibody (Fig. 4c, Extended Data Fig. 11d, e).c, Separase-dependent cleavage 
of MCL1(S/T60,159,163D/E) is independent of NEK2A. Immunoblots and 


Nek2a + Cdk1/2-Cyc.A2: + + 


autoradiography of combined kinase (’P) and cleavage assay. d-f, In 
vitro-translated, *S-labelled wild-type BCL-XLATM and variants thereof were 
incubated with the indicated kinases (+) or reference buffers (-) and active 
separase before SDS-PAGE and autoradiography. d, NEK2A-stimulated 
cleavage of BCL-XL by separase essentially requires Ser4 and Serl64. e, CDK1/2- 
cyclin A2-stimulated cleavage of BCL-XL by separase essentially requires 

Ser62. In vivo phosphorylation of Ser62 has been previously reported**. 

f, Separase-dependent cleavage of BCL-XL(S4,62,164D) occurs independently of 
NEK2A and CDK1/2-cyclin A2. g, Autoradiograph and immunoblot of kinase assay 
using in vitro-translated, wild-type BCL-XLATM or its S4,62,164A variant and 
y°P-ATP in combination with the indicated kinases (+) or reference buffers (-). 
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Data exclusions No data were excluded. 

Replication The values obtained from distinct experimental trials were reproducible. The data are presented as means together with the corresponding 
individual data points from each repetition to indicate biological variation. Experiments analysed by immunoblotting were repeated 2-4 times 
with similar results (2-4 biological replicates). 


Randomization n/a 


Blinding For quantitative analyses of chromosome spreads and IFM specimen the investigators were blinded to sample allocation. 
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Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Rabbit anti-separase (1.5 ug/ml) see Ref. in Material and Methods, mouse anti-Flag M2 (1:2,000; Product No. F1804; Sigma- 
Aldrich), rabbit anti-Sgo2 (1:1,000; Product No. A301-262A; Lot. No. A301-262A-1; Bethyl), rabbit or guinea pig anti-Sgo2 (1.5 ug/ 
ml; raised by Charles river laboratories against the peptide: DVPPRESHSHSDQSSKC), rabbit anti-Sgo1 (1:500, Abcam ab21633), 
mouse anti-securin (1:1,000; clone DCS-280; Code No. KO09-3; MBL), rabbit anti-phosphoSer10-histone H3 (1:1,000; Product No. 
06-570; Lot No. 2370127; Millipore), mouse anti-cyclin B1 (1:1,000; Product No. 05-373; Lot. No. 2199734; Millipore), rabbit 
anti-Pin1 (1:1,000) see Ref. in Material and Methods, rabbit anti-caspase 3 Asp175 (1:1,000; clone 5A1E; Product No. 9664; Cell 
Signaling), rabbit anti-Bcl-xL (1:1,000; Product No. 2762; Cell Signaling), mouse anti-Mcl1 (1:800; clone W16014A; Product No. 
695702; BioLegend), guinea-pig anti-Mcl1 (1 ug/ml; raised by Charles River Laboratories against bacterially expressed human 
Mcl1ATM), guinea pig anti-phosphoSer10-Mcl1 ('Mcl1-pS60'; 0,5 ug/ml; for IFM 1 ug/ml; anti-CVIGGpSAGA liberated from 
reactivity towards CVIGGSAGA), rabbit anti-Sororin (1 ug/ml; raised by Charles River Laboratories against bacterially expressed 
full-length human sororin), rabbit anti-PARP (1:800; clone 46D11; Product No.9532; Cell Signaling), mouse anti-PARP (1:1,000; 
clone C-2-10; Product No. AM30; Calbiochem), mouse anti-Tom20 (1:500; Santa-Cruz Biotechnology, F10), mouse anti- 
cytochrome c (1:1,000; BD Pharmingen, 7H8.2C12), mouse anti-BubR1 (1:1,000; BD Transduction Laboratories, clone 9), rabbit 
anti-Bax (1:1,000; Abcam, ab32503), rabbit anti-Bak (1:1,000; Abcam, ab32371), rabbit anti-Phosphothreonine (1 ug/ml; Product 
No. 71-8200; Invitrogen), rabbit anti-Mad2 (1:1,000; Bethyl, A300-300A), anti-MBP monoclonal (1:1,000; NEB Biolabs; HRP- 
conjugated; Product No. E8032), mouse anti-Nek2 (1:600; clone 20/Nek2 Ruo; Product No. 610593; BD Transduction 
Laboratories), goat anti-Cdc27 (1:1,000) see Ref. in Material and Methods, mouse anti-RGS-His (1:1,000; Product No. 34610; 
Qiagen), rabbit anti-phosphoSer139-histone H2A.X (yH2AX; 1:5,000; clone EP854(2)Y; Product No. MABE205; Lot No. 2034733; 
Millipore), mouse anti-topoisomerase Ila (1:1,000; clone 1C5; Product No. ADI-KAM-CC210-E; Enzo Life Sciences), mouse anti- 
cyclin A2 (1:200; clone 46B11; Product No. sc-53234; Santa Cruz Biotechnology), rat anti-HA (1:2,000; clone 3F10; Product No. 
11867423001; Roche), mouse anti-GFP (hybridoma supernatant 1:2,000; gift from D. van Essen and S. Saccani), and mouse anti- 
a-tubulin (hybridoma supernatant 1:200; DSHB; clone 12G10), rabbit anti-murine Pttg1 and anti-murine SgoL2 sera (both 
1:1,000; gift from Alberto M. Pendas). 


Validation 


Eukaryotic cell lines 


Validation procedures used for commercial antibodies are described by the manufacturer. Self-made polyclonal rabbit or guinea 
pig antibodies are validated using multiple methods (e. g. ICC/IF, WB or IP). To address antibody specificity we used the 


corres 


ponding antigen (e.g. recombinantly expressed in E. coli, IVTT or cell lysate) as poitive and siRNA depleted lysate or fixed 


cells as negative control. 


Policy information about cell lines 


Cell line source(s) 


Authentication 


Mycoplasma contamination 


Commonly misidentified lines 
(See ICLAC register) 


He 


293T (Gift from Marc W. Kirschner); HeLa-K (Gift from Thomas U. Mayer); RPE-1 hTERT (ATCC, CRL-4000); HCT116 


parental and Bak/Bax DKO (Gift from Richard Joule); SW480, T47D, A427 and A549 (Gift of Michael Orth from); NIH3T3 
purchased from ATCC (CRL-1658); TP53 KO (Gift from Bert Vogelstein); Xenopus laevis $3 (Gift from Guowei Fang) 


Ce 


| were authenticated via visual inspection of typical morphology, by immunoblotting analyses and cell synchronization 


behavior, and through selective resistance to antibiotic treatments. 


Ce 


| lines were not tested for mycoplasma contamination but microscopic inspections of their fluorescently labeled DNA 


contents were inconspicuous. 


No 


commonly misidentified cell lines were used. 
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In the Acknowledgements of this Article, the text “and the Quantum 
Flagship PASQUANS” should be replaced with the sentence “This project 
(or publication) has received funding from the European Union’s Hori- 
zon 2020 research and innovation programme under grant agreement 
no. 817482 (PASQuanS).’ The original Article has been corrected online. 
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In Fig. 1b of this Article, owing to errors in the production process, there 
is an incorrect P value in the left-hand graph of this panel. The value 
‘P<10™, corresponding to the comparison between control and 
RIPK1'®, should be ‘P= 2 x 10. In addition, the label ‘RIPK1-*” was 
duplicated in Fig. 1c. These errors have been corrected online. 
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Shyuan Ngo and her lab members work in rotating groups to maintain distance. 


KEEPING ON WITH 
ESSENTIAL SCIENCE 


For scientists working on urgent research, 
staying home is not an option. By Virginia Gewin 


round the world, universities have 
closed because of COVID-19 — forcing 
an increasing number of researchers 
to work and teach from home. 

Some scientists can’t simply stop 
going to their laboratories — especially not 
those who are overseeing clinical trials that 
could offer life-saving vaccines and therapies, 
particularly against the new coronavirus. And 
some research activities unrelated to vaccine 
production must continue, even in the face of 
aninstitutional shutdown. 

“Animals need to be looked after, and breed- 
ing lines must be kept going. Many of these are 
unique and can’t be regenerated,” says Mike 
Turner, director of science at Wellcome, a 
research-funding charity in London. 

If you have to go to your lab, says Turner, 
comply with all of your institution’s safety 


regulations, not just those for preventing 
COVID-19. For example, stick to the ‘buddy 
system’ — by working in groups of at least 
two people — when necessary for your safety. 
Many current guidelines advise a person not 
to come within 2 metres of anyone for longer 
than 15 minutes. 

Turner also suggests creating schedules 
to make sure the essential tasks of each lab 
at an institution get done if COVID-19 strikes 
among the members in any one of the labs. 

“If someone gets ill, they know who to 
tell, and that person knows it is their role to 
continue the task,” he says. “If your work is 
contributing to [research against] COVID-19, 
we applaud you, and please carry on. Ifit’s not, 
ask, ‘Is it absolutely essential?” 

Here, four scientists offer advice on 
the precautionary measures necessary to 


© 2020 Springer Nature Limited. All rights reserved. 


continue essential research in the face of the 
pandemic. 


JOHN MORRISON 
REDESIGN STUDIES TO BE 
LESS LABOUR-INTENSIVE 


The status of the lab changes every day. In 
California, we have a statewide order to 
‘shelter in place’, or to stay home except to 
do essential tasks, such as seeking health 
care or buying groceries. My colleagues and 
l are exempt from that order because we are 
beginning aggressive COVID-19 research in 
collaboration with the Center for Immunol- 
ogy and Infectious Diseases at the University 
of California, Davis, and we care for animals. 

Weare exercising an extreme form of social 
distancing while at work. It’s complicated for 
our group. We have to worry about the health 
of our people — our primary concern -— as well 
as the health of the primate colony we use for 
research. We have to assume that there could 
be human-monkey transmission and vice 
versa. And we have to maintain the colony. 

We are not starting any new protocols or 
studies beyond COVID-19 research, and are 
doing what we can to keep existing research 
protocols going. We decided to give prefer- 
ence to longitudinal studies, to make sure data 
collected previously remain meaningful. We 
are asking people to redesign their studies to 
require 50% less labour while salvaging the 
study and making it rigorous and reproduci- 
ble. It can be terribly difficult to decide which 
studies must have data collection stopped. 
It’s important to keep the measures going that 
yield the biggest bang for the buck, or the most 
information for the least effort and lowest 
cost. For example, can you collect samples 
less often without compromising the validity 
of the study? In some instances, the smartest 
thing to do might be to accelerate the study’s 
completion to get it out of the way. 

We have 300 people here, and are reducing 
our on-site staff by at least 50%. But we are the 
only primate-research centre growing the new 
coronavirus to help develop clinical tests. Peo- 
ple are working from home, staggering their 
shifts and spreading out across labs. What gets 
hard is launching new COVID-19 research with 
our reduced capacity. And it became very clear 
that most of the national primate-research 
centres are initiating COVID-19 research as a 
high priority. We’re doing that because the 
monkey model for the disease will be extraor- 
dinarily powerful, and we need it yesterday. 
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Work / Careers 


Protecting our workforce is our highest 
priority, followed closely by helping to fight 
this pandemic while keep our monkeys safe. 


John Morrison is director, California National 
Primate Research Center at the University of 
California, Davis. 


VIJIVUAYAN 
COLLECTIVELY SACRIFICE 
TO KEEP WORK GOING 


Our university has not yet considered shutting 
down, but we encourage those who can to 
work from home. For those continuing wet- 
bench lab work, we follow the clear advice 
that we receive from Singapore’s Ministry of 
Health and our university administrators on 
social-distancing guidelines and on specific 
directives, such as a 14-day self-quarantine for 
people arriving from certain countries. For- 
tunately, the university already had a robust 
biorisk-management system in place, which 
made it easier to beef up our response to the 
COVID-19 outbreak. We are now taking pre- 
cautions to make sure work goes on, because 
not everyone can work remotely. 

During the first couple of days of the 
outbreak, our infectious-disease researchers 
were working long hours. We reduced those, 


“Our work isn’t about 

academic credit right now — 
it’s about what we can do for 
our country and the world.” 


because it is not safe to work while fatigued. 
We now make sure that working hours are 
within acceptable limits so that people don’t 
get tired — and have time to take breaks. 

There are about 400 researchers at our 
medical school across roughly 50 wet-bench 
labs. Roughly 20% of them work on infectious 
disease, and the others work on cancer, meta- 
bolic disorders and neurobehaviour. We have 
thinned our workplace staff by reducing the 
number of people working in labs by about 
40-50%. Labs have broken into teams, and 
have adopted a variety of strategies to distance 
workers in both time and space. 

Our biosafety-level 3 (BSL-3; level 4 is the 
most strict) lab is small, so researchers work- 
ing in that space have split into early-morning 
and late-afternoon shifts. In other labs, teams 
either switch days of each week — working 
Monday to Wednesday or Thursday and Fri- 
day — or split into different floors of the build- 
ing, depending on what works best for their 
research. That way, if one group is quaran- 
tined, the other group can take over. The teams 
each wear differently coloured stickers all the 
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time to avoid members from other teams. 

In addition, given the increased demand for 
the small BSL-3 space, researchers can make 
non-infectious coronavirus by extracting its 
RNA so they can work with it, when feasible, 
in the less-stringent BSL-2 lab environments. 

Continuing lab experiments is possible, but 
not easy. It’s a collective sacrifice to make big 
changes inthe way you work and live, but these 
are absolutely essential to combat this out- 
break. Our biggest asset is our people at work. 


Viji Vijayan is associate dean of safety and 
emergency management, Duke-National 
University of Singapore Medical School. 


SHYUANNGO 
LET ETHICS GUIDE WHICH 
RESEARCH T0 CONTINUE 


My lab of ten includes postdoctoral research- 
ers, PhD students and research assistants. 
Normally, we do natural-history studies with 
people with motor-neuron disease (MND) in 
aclinical setting, and take everything welearn 
back tothe preclinical laboratory, using mouse 
models of MND and stem cells from people 
with the condition. Right now, the University 
of Queensland remains open, although it 
recently announced a move to teaching online 
and advised against non-essential travel. 

Research-group leaders were asked to 
develop contingency plans to reduce the num- 
ber of people in their labs. My labis one of the 
few on campus that has adopted those plans. 
We shut down clinical research because our 
participants are at risk of respiratory distress 
from COVID-19. We also stopped projects that 
were initiated inJanuary, choosing instead to 
focus our efforts on projects that were closer 
to completion. It would be unethical to cull 
animals that we have already been working 
with for almosta year. Similarly, we’re invested 
in making sure we get all the data necessary 
from stem cells grown from people with MND, 
because they have donated their time and 
invested in the project. And this research was 
funded by publicly donated money — another 
reason not to shut it down. For safety, we div- 
vied the lab into two groups, each composed of 
four individuals. Within each team, we rotate 
the schedule, leaving a30-minute gap between 
shifts, so that people never cross paths in the 
lab. We do this throughout the open space we 
share with three other labs. The stem-cell work 
happens in the early morning and late after- 
noon, so that the animal team can continue 
collecting samples at the same time it hasbeen 
doing every day, for continuity. 

Consult with your whole team on how to 
prioritize projects and schedules and involve 
everyone in decision-making, to avoid a sense 
of uncertainty and disgruntlement. We don’t 
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need that right now. We need to all work 
together. 


Shyuan Ngo is an MND researcher at the 
University of Queensland in Brisbane, Australia 


MARK DENISON 
COMMUNICATE CONSTANTLY 


Istarted working on coronaviruses in1984, and 
have worked through severe acute respiratory 
syndrome (SARS) and Middle East respiratory 
syndrome (MERS).1Iamsobered and subdued 
by the reality of the COVID-19 pandemic. Our 
team’s priority is work on countermeasures. 

We are conducting in vitro studies of poten- 
tial antiviral drugs including remdesivir, alone 
or in combination with other compounds, 
while continuing our long-standing collabo- 
ration with Ralph Baric’s lab at the University 
of North Carolina in Chapel Hill. Theat group 
is developing animal models to test these 
potential COVID-19 therapies. We expect to 
continue participating in collaborations to 
test vaccines. My goals are making sure noth- 
ing hampers the speed of vaccine trials, and 
identifying any drug combinations that could 
have a high impact in mitigating the disease. 

Our work isn’t about academic credit right 
now — it’s about what we can do for our country 
and the world. We want to contribute swiftly, 
so I’m working out how to do that with what 
we have. Already, the university has recog- 
nized our essential work and commitment to 
safety, and has allowed postdocs and graduate 
students to continue working in our lab. 

Ihave a12-member team; 6, including me, are 
trained to workinaBSL-3 lab. We work long days, 
seven days a week. I try not to do that, but peo- 
pleinthe emergency rooms in Italy are working 
more than that — and they don’t havea choice. 

At the same time, we can’t get so tired that 
we make mistakes. We can’t work all the time. 
We take steps to protect ourselves and each 
other. Everyone absolutely has to report any 
symptoms of any kind — from an earache toa 
runny nose. We are in constant contact. 

To keep everyone more than 2 metres apart, 
we have got people working in other labs that 
shut down. We conduct our staff meetings by 
talking across the halls from individual labs, 
or through the videoconferencing platform 
Zoom. If one of us gets COVID-19, the whole 
research programme could shut down. 


Mark Denison is director of paediatric 
infectious diseases, Vanderbilt University 
Medical Center, Nashville, Tennessee. 


Interviews by Virginia Gewin 
These interviews have been edited for length 
and clarity. 
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Work / Technology & tools 


. 


DEEP LEARNING 
TAKES ON TUMOURS 


Artificial-intelligence methods are moving 
into cancer research. By Esther Landhuis 


scancer cells spreadinaculture dish, 
Guillaume Jacquemet is watching. 
The cell movements hold clues to 
how drugs or gene variants might 
affect the spread of tumours in 
the body, and he is tracking the nucleus of 
each cell in frame after frame of time-lapse 
microscopy films. But because he has gener- 
ated about 500 films, each with 120 frames 
and 200-300 cells per frame, that analysis 


is challenging to say the least. “If 1 had to do 
the tracking manually, it would be impossi- 
ble,” says Jacquemet, a cell biologist at Abo 
Akademi University in Turku, Finland. 

So he has trained a machine to spot the 
nuclei instead. Jacquemet uses methods avail- 
able ona platform called ZeroCostDL4Mic, 
part of agrowing collection of resources aimed 
at making artificial intelligence (AI) technol- 
ogy accessible to bench scientists who have 
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minimal coding experience’. 

Al technologies encompass several 
methods. One, called machine learning, uses 
data that have been manually preprocessed 
and makes predictions according to what the 
Allearns. Deep learning, by contrast, caniden- 
tify complex patterns in raw data. It is used in 
self-driving cars, speech-recognition software, 
game-playing computers — and to spot cell 
nuclei in massive microscopy data sets. 
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Work / Technology & tools 


Deep learning has its origins in the 1940s, 
when scientists built a computer model that 
was organized in interconnected layers, like 
neurons in the human brain. Decades later, 
researchers taught these ‘neural networks’ to 
recognize shapes, words and numbers. But 
it wasn’t until about five years ago that deep 
learning began to gain traction in biology and 
medicine. 

Amajor driving force has been the explosive 
growth of life-sciences data. With modern 
gene-sequencing technologies, a single exper- 
iment can produce gigabytes of information. 
The Cancer Genome Atlas, launched in 2006, 
has collected information on tens of thou- 
sands of samples spanning 33 cancer types; 
the data exceed 2.5 petabytes (1 petabyte is 
I million gigabytes). And advances in tissue 
labelling and automated microscopy are 
generating complex imaging data faster than 
researchers can possibly mine them. “There’s 
definitely a revolution going on,” says Emma 
Lundberg, a bioengineer at the KTH Royal 
Institute of Technology in Stockholm. 


Boosting image-based profiling 


Cancer biologist Neil Carragher caught his 
first glimpse of this revolution in 2004. He 
was leading a team at AstraZeneca in Lough- 
borough, UK, that explores new technologies 
for the life sciences, when he came across a 
study that made the company rethink its 
drug-screening efforts. He and his team had 
been using cell-based screens to look for prom- 
ising drug candidates, but hits were hard to 


WANTED: 
MORE DATA 


Deep-learning models can process raw 
data, but first they must be trained with 
annotated information. 


It takes vast amounts of labelled data to 
train deep-learning models. But that’s not 
always easy to come by, says Casey Greene, 
a computational biologist at the University 
of Pennsylvania in Philadelphia. “Data are 
cheap, but labelled data are expensive.” 

In the genomics realm, sequences are 
abundant and publicly available. But their 
associated descriptions, or metadata, are 
often missing, wrong or unstandardized, 
says Emily Flynn, a doctoral candidate 
in biomedical informatics at Stanford 
University in California. A researcher wanting 
to train a model to detect non-small-cell 
lung cancer in samples from patients, for 
example, might well find data sets variously 
labelled ‘nsclc’, ‘non small-cell’ or ‘non small 
cell LC’ — differences that confound analysis 
tools. Or samples might be labelled ‘disease: 
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come by. The study was suggesting that Al 
and analytics could help them to improve 
their screening processes’. “We thought this 
could be asolution to the productivity crisis,’ 
Carragher says. 

But AI technologies can be difficult for 
biologists to master. Jacquemet says he 
once spent more than a week trying to 
install the correct software libraries to runa 
deep-learning model. Then, he says, “you need 
to learn to code in Python’ to use it. 

Carragher’s AstraZeneca team worked with 
computational biologist Anne Carpenter and 


“Ift had todothe 
tracking manually, 
it would be impossible.” 


her colleagues at the Broad Institute of MIT 
and Harvard in Cambridge, Massachusetts, to 
scale up the image-profiling method used in 
the 2004 paper and to investigate the effects of 
multiple drugs on human breast-cancer cells®. 
Carpenter went on to develop the technique 
into a procedure called Cell Painting, which 
stains cells witha panel of fluorescent dyes and 
then uses the open-source software CellPro- 
filer to generate profiles of the cells. 

Still, these analyses can be labour-in- 
tensive, says Carragher, who now heads 
cancer-drug discovery at the University of 
Edinburgh, UK. Even with open-source tools 
that avoided the need to code the machine 


glioblastoma’ and ‘disease: yes’, says 
biostatistician Colin Dewey at the University 
of Wisconsin-Madison. 

To help organize those data, Dewey 
created a computational pipeline called 
MetaSRA, which uses text-mining 
techniques to standardize and store 
metadata on public sequences. And Greene 
and colleagues have built refine.bio, 

a repository that harmonizes data on 
expression and RNA sequencing. Working 
with Stanford bioengineer Russ Altman, 
Flynn is using machine-learning techniques 
to infer missing labels from gene-expression 
data to improve annotations in refine.bio. 

In bioimaging, the problem lies more in 
annotation. To label a set of histopathology 
slides, for example, “someone has to go in 
and draw a bounding box around the parts 
that are cancer”, Greene says. “And that 
person probably makes a lot of money.” 
Now developers are training deep-learning 
algorithms to label nuclei and other 
structures in cell images, while the Image 
Data Resource and other online repositories 
are making it easier for researchers to share 
and find life-sciences images. 
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learning from scratch — and a computing 
cluster with thousands of processors and 
terabytes of memory — it could take amonth 
or soto work out which cellular features they 
should tell the image-analysis software to 
look at, Carragher says. And after optimiz- 
ing the parameters for each cell line, his team 
had to tinker further to get it to work across 
all cells. 

Last year, he and his team explored how 
deep learning could improve this process. 
The impetus was a 2017 analysis* posted on 
the bioRxiv preprint server by researchers 
at Google’s headquarters in Mountain View, 
California. The researchers had downloaded 
Carragher’s breast-cancer data set from the 
Broad Bioimage Benchmark Collection and 
used it to train a deep neural network that pre- 
viously had seen only general images, such as 
cars and animals. By scanning for patterns in 
the breast-cancer data, the model learnt to dis- 
cern cellular changes that are meaningful for 
drug discovery. Because the software wasn’t 
told what to look for, it found features that 
researchers hadn’t even considered. 

Building on that effort, Carragher and his 
colleagues screened 14,000 compounds 
across 8 forms of breast cancer’®. “We dididen- 
tify some interesting hits,” he says — includ- 
ing a compound that was already known to 
modulate receptors for serotonin, which is 
important in mammary-gland development, 
as they reported earlier this year®. 

At the Broad Institute, a team led by com- 
putational biologist Juan Caicedo is applying 
image-based profiling to screen for genetic 
mutations. He and his team overexpressed 
various gene variants in lung-cancer cells, 
stained them with the Cell Painting protocol 
and looked for differences in the cells that 
suggest possible pharmaceutical opportuni- 
ties. They found that machine learning could 
identify meaningful variants in images about 
as well as processes that measure gene expres- 
sioninthe cells. The researchers reported their 
results at the Al Powered Drug Discovery and 
Manufacturing Conference in February at 
the Massachusetts Institute of Technology in 
Cambridge. 

As part of the Cancer Cell Map Initiative, 
which maps molecular networks under- 
lying human cancer, researchers are 
training a deep-learning model to predict 
drug responses on the basis of a person’s 
cancer-genome sequence. Such predictions 
have life-or-death implications, and accuracy 
is crucial, says Trey Ideker, a bioengineer at 
the University of California, San Diego. But 
some are reluctant to accept results when 
the mechanisms behind them aren't clear, 
and deep neural networks produce answers 
without revealing their process — a problem 
known as ‘black-box’ learning. “You want to 
know why,’ says Ideker. “You want to know 
the mechanism.” Ideker’s team is creating 
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Cell nuclei (top, DNA stain) are automatically detected using the CellProfiler method (bottom). 


a ‘visible’ neural network, which links the 
model’s inner workings more directly to 
cancer cell biology. As a proof of concept, 
the team created a model for yeast cells. 
Called DCell, it can predict the effects of gene 
mutations on cell growth and the molecular 
pathways underlying those effects’. 


The spatial dimension 


Lundberg and others in Sweden are using deep 
learning to tackle another computational 
challenge: assessing protein localization. 
The work is part of the Human Protein Atlas, 
a multi-year, multi-omics effort to map all 
human proteins. Spatial information reveals 
where proteins are located in cells, and tend to 
be under-represented in systems-level studies, 
Lundberg says. But if researchers knew this 
information, they could use it to glean insights 
about the underlying biology, she suggests. 

Enter Al. In 2016, Lundberg and her 
colleagues invited gamers to help computers 
classify proteins’ whereabouts in cells. The 
citizen scientists took part ina role-playing 
game called EVE Online, in which they had to 
pinpoint fluorescently labelled proteins to win 
game credits, boosting an Al system already 
used for this purpose. But even the augmented 
system trailed human experts in terms of accu- 
racy and speed. 

So, in 2018, Lundberg’s team took its 
images to Kaggle — a platform that chal- 
lenges machine-learning experts to develop 
their best models to crack data sets posted by 
companies and researchers. Over the course 
of 3 months, 2,172 teams around the world 
competed to develop a deep-learning model 
that could look at a cell stained for a protein 
and several reference markers, and work out 
the protein’s spatial distribution. 

The task was challenging. Half of human 
proteins are found in multiple places in cells, 
says Lundberg. And some cellular compart- 
ments — the nucleus, for example — are much 
more common locations than others. 

Still, the Kagglers delivered, Lundberg 
says. Most of the leading strategies came 
from computational scientists with no biol- 
ogy background — including Bojan Tunguz, 
a software engineer who created models that 
predict earthquakes and loan defaults before 


earning one of the top spots in the Human 
Protein Atlas contest. The approach to these 
problems is similar across vastly different 
disciplines, Tunguz says. 

The best model identified both rare and 
common locations across a variety of cell 
lines and, most importantly, captured mixed 
patterns well, Lundberg says. The algorithm 
performed almost as accurately as human 
experts, and with greater speed and repro- 
ducibility. Furthermore, it could quantify 
the spatial information’. “When we can quan- 
tify it, and not just describe it with a label, 
we can integrate it with other types of data.” 
That includes ‘omics’ data, which are already 
transforming cancer research. 

A computational framework known as 
DeepProg applies deep learning to ‘omics’ data 
sets, including gene expression and epigenetic 
data, to predict patient survival, for instance’. 
And DigitalDLSorter predicts outcomes by 


“When we can quantify it, 
and not just describe it 

witha label, we canintegrate 
it with other types of data.” 


inferring types and quantities of immune cells 
directly from tumour-RNA sequencing data 
rather than relying on laborious conventional 
workflows”®. 


Onthe horizon 


Many of the tools needed to build 
deep-learning models are freely available 
online, including software libraries and cod- 
ing frameworks suchas TensorFlow, Pytorch, 
Keras and Caffe. Researchers wanting to ask 
questions and brainstorm solutions to prob- 
lems that crop up with image-analysis tools 
can make use of an online resource called 
the Scientific Community Image Forum 
(https://forum.image.sc). Also becoming avail- 
able are repositories that allow researchers 
to find and repurpose deep-learning models 
for related tasks — a process called transfer 
learning. One example is Kipoi, which allows 
researchers to search and explore more than 
2,000 ready-to-use models trained for tasks 
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such as predicting how proteins known as 
transcription factors will bind to DNA, or 
where enzymes are likely to splice the genetic 
code. 

Working with other tool developers, 
Lundberg’s team put together a rudimentary 
‘model zoo’ (https://bioimage.io) to quickly 
share its Human Protein Atlas models, and is 
now creating amore sophisticated repository 
that will be useful to model producers and 
non-expert users alike. 

A platform called ImJoy will be part of 
this effort, Lundberg says. Created by Wei 
Ouyang, a postdocin her lab, the platform lets 
researchers test and run Al models through a 
web browser on their computer, inthe cloud or 
onaphone. Sharing bioimaging data sets and 
deep-learning models will also be a priority 
for the Center for Open Bioimage Analysis, an 
effort funded by the US government and led by 
Carpenter and Kevin Eliceiri, a bioengineer at 
the University of Wisconsin-Madison. 

Another option, ZeroCostDL4Mic, 
launched last month. Developed by bio- 
physicist Ricardo Henriques at University 
College London, ZeroCostDL4Mic makes 
use of Colab, Google’s free cloud service for 
Al developers, to provide access to several 
popular deep-learning microscopy tools, 
including the oneJacquemet uses to automate 
cell-nuclei labelling in his films. “Everything 
you need is installed within a couple of min- 
utes,” Jacquemet explains. With a few mouse 
clicks, users can use example data to traina 
neural network to complete the desired task 
(see ‘Wanted: more data’), then apply that net- 
work to their own data — all without needing 
to code. 

Researchers who want to use larger datasets 
or train more-complex models might need 
to purchase or access extra computational 
resources beyond Google’s free service. 

By easing the way for biologists with scant 
know-how and resources to use deep learning, 
Henriques says, ZeroCostDL4Mic acts like “a 
gateway drug” for Al, luring researchers to 
explore the software underlying these tools 
that will continue to transform research in 
cancer and beyond. 


Esther Landhuis is a science journalist based 
near San Francisco, California. 
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akapo are probably the weirdest 
birds in the world: they’re large, 
flightless, nocturnal parrots. 
Found only in New Zealand, they 
are critically endangered and now 
live only on four predator-free sanctuary 
islands. Thanks to intensive management 

of their populations and breeding, we now 
have 201 birds in our sanctuaries — up froma 
low of 51in 1995. 

As the conservation biologist on the 
kakapo team, I work mainly in our office in 
Invercargill on South Island, New Zealand, 
60 kilometres from one of our sanctuary 
islands. | communicate with international 
experts about disease, inbreeding and 
other threats, and about solutions. Last year 
we had a huge outbreak of aspergillosis, a 
fungal disease, that made 21 birds ill and 
killed 9. 

Kakapo are unusual in that they manage 
on very low levels of vitamin D. We're trying 
to understand this, which might help us to 
improve supplemental feeding. The birds 
breed every few years when the rimu tree 
fruits. The berries have very high levels of 
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vitamin D, which the birds might need to lay 
eggs and raise chicks. 

During the breeding season, I'll spend 
months on the sanctuary islands off 
South Island and near Auckland in the 
north. We use artificial insemination to 
boost fertility and avoid inbreeding. We 
fit every bird with a tracker and an activity 
monitor, which they wear like a backpack. 
We use drones to send fresh semen around 
the island to make the best matches for 
receptive females. 

We're now on near-total lockdown in 
New Zealand, so we can’t monitor the 
kakapo in person. We'll continue to use our 
remote-monitoring systems to keep as close 
an eye on them as we can. 

It’s rewarding to work with this amazing 
species that most people never get to 
see. Sinbad, my favourite (pictured), is 
inquisitive around people. People fall in love 
with kakapo when they know them. 


a 


Andrew Digby is Science Adviser Kakapo/ 
Takahe for the New Zealand Department of 
Conservation. Interview by Amber Dance. 
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ome analysts are cynical about the apparent mismatch between 

spending and outcomes in what’s disparagingly called ‘the 

cancer industry’. How can cancer be the second leading cause of 

death globally, responsible for an estimated 9.6 million deaths 

in 2018, when asingle institution (the US National Institutes of 
Health, NIH) spent US$24.4 billion on cancer research in the past four 
years, not to mention outlays by so many other funders? 

As the graph on page S7 shows, researchers are nudging the dial on 
some types of cancer more than others. This Nature Index supplement 
focuses on three — cervical, prostate and melanoma — asalens through 
which to view the kinds of preventions and treatments that are length- 
ening survival rates, at least in high-income countries. 

Dimensions data provide interesting comparisons on value for 
money. Asarough indication, looking at the top ten funders’ total grants 
for cancer research from 2010 to 2019 beside their cancer research 
publications over the same period, the average for the National Natural 
Science Foundation of China is US$21,902 per publication. By contrast, 
for the US National Cancer Institute, part of the NIH and the world’s 
biggest funder of cancer research, it is US$129,624 per article. 

The above analysis is blind to article quality. For that, the indica- 
tor is publication in the 82 high-quality journals selected by experts 
for inclusion in the Nature Index, which, it should be noted, does not 
include clinical sciences journals. In cancer, as in every other field, 
China’s rise is striking. Its cancer research in the Nature Index rose by an 
estimated 114.9% from 2015 to 2019, according to our key metric, Share, 
afractional count of the proportion of the country’s affiliated authors 
oneach article. The number of cancer research articles published in 
Nature Index journals, identified through a search using Dimensions, 
grew by 25.8% over the same period, more than four times the growth 
for articles overall. One reason that cancer outcomes seem not to be 
improving inline with research output is that improved treatments have 
not been accessible to all, as our stories about cervical and prostate 
cancer explain (pages S2 and S5). Therapies, let alone the latest treat- 
ments, may be out of reach in the low- and middle-income countries 
where 70% of global cancer deaths occur. That’s not a problem science 
alone can solve. 


Catherine Armitage 
Chief editor 
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Akello Faith receives her HPV vaccine at a mobile health clinic in Ochaga, Uganda. 
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, Aglobal drive 


towards elimination 


Vaccination is picking up where screening left off in 
reducing cervical cancer rates. By Sarah DeWeerdt 


ervical cancer is already a prevention 
success story. The widespread use 
of the Pap smear, a screening test to 
identify pre-cancerous and cancerous 
cells collected from the cervix, has led 
tosteep declines in diagnoses and deaths from 
cervical cancer over the past half-century, 
especially in high-income countries. Cervical 
cancer is still one of the most common cancers 
in women, so the World Health Organization 
(WHO)has an ambitious goal: to eliminate it asa 
public health problem. Its World Health Assem- 
bly plansto vote onastrategy to put all countries 
onapathtocervical cancer elimination by 2030. 

Nearly all cases of cervical cancer are caused 
by the human papilloma virus (HPV), which is 
spread through sexual contact. Most sexually 
active adults have been exposed, although only 
some strains cause cervical cancer. 

Since 2006, the HPV vaccine has achieved 
further stunning results in cervical cancer 
prevention. For example, in England, where a 
national vaccination programme for adolescent 
girls began in 2008 and expanded to include 
boys last year, public health officials revealed 
in January 2020 that less than 2% of sexually 
active young women tested in 2014 to 2018 
carried HPV strains 16 and 18. Those strains sig- 
nificantly increase the risk of cervical cancer 
and are responsible for 70% of cases. The results 
are “very impressive’, says Lois Ramondetta, a 
gynaecologic oncologist at the MD Anderson 
Cancer Center in Houston, Texas, which is 
ranked fourth in the Nature Index for cancer 
research output. “It’s more evidence for how 
effective the vaccine is, as well as for the impor- 
tant concept of herdimmunity.’ Herd immunity 
is the idea that high vaccination rates can vir- 
tually eliminate the virus at a population level. 

In cancer prevention and women’s health 
circles, the news raises hopes that the WHO’s 
goalis insight and cervical cancer could soon 
become vanishingly rare. 


Anewscreening method 

To epidemiologists and public health officials, 
‘eliminate’ has a specific definition: fewer than 
4 new cases per 100,000 women per year. The 
current global rate stands at 13.1 per 100,000; 
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more than 80% of diagnoses and deaths occur 
inlow- and middle-income countries. Reaching 
that benchmark depends on both vaccinating 
adolescent girls and screening adult women. 
But as far as screening goes, the Pap tests that 
have been key to reducing the burden of cervical 
cancer may playa relatively small role going for- 
ward. Testing cervical samples for the presence 
of DNA from high-risk HPV strains, rather than 
for abnormal cells, asina Pap smear, is now seen 
as amore effective way to identify women at risk 
of developing cervical cancer, partly because 
it may identify women at risk earlier thana Pap 
smear can, and is more objective. HPV testing 
replaced the Pap smear in Australiain December 
2017 and the US Preventive Services Task Force 
recommended the test, alone or incombination 
with Pap smears, in 2018. Many women in low- 
and middle-income countries lack access to any 
form of cervical cancer screening, but public 
health researchers are investigating howto roll 
out HPV testing programmes in these settings. 


Success story 


HPV testing of adult womenis crucial to reduc- 
ing cervical cancer rates inthe short term, says 
Karen Canfell, director of research at the Can- 
cer Council of New South Wales in Australia, 
because cervical cancer typically develops 
15 to 20 years after HPV exposure in healthy 
women. In England, 96% of patients diagnosed 
with early stage cervical cancer are still alive 
one year later, compared with only 50% of those 
witha late-stage diagnosis. 

But inits early stages cervical cancer’s symp- 
toms are often non-specific, such as unusual 
vaginal bleeding, or absent altogether, making 
screening tests all the more important. Vaccina- 
tion, meanwhile, is key to driving down cancer 
rates over the longer term. 

Rapid scale-up of both vaccination and HPV 
screening could prevent up to 13.4 million 
cases of cervical cancer worldwide in the next 
50 years, and enable all countries to reduce 
incidence below 4 per 100,000 by the end of 
the century, according to a global modelling 
study led by Canfell. 

Australiacould bethe first to reach the WHO’s 
benchmark for cervical cancer elimination. Its 
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nationwide, publicly funded, school-based HPV 
vaccination programme was introduced for 
girls in 2007, and expanded to include boys in 
2013. The country’s universal health-care sys- 
tem provides access to cervical cancer screen- 
ing at no or lowcost to most women. 

Ifhighrates of both vaccinationand screening 
continue, Australia could eliminate cervical can- 
ceras early as 2028, according to calculations by 
Canfell and her colleagues. 

But reaching the benchmark of elimination 
in Australia and elsewhere does not mean ineq- 
uities will be removed, she cautions. Cervical 
cancer rates are twice as high, and deaths three 
times as frequent, among Aboriginal womenin 
Australia compared to non-Aboriginal women. 
According to the US Centers for Disease Con- 
trol and Prevention, cervical cancer rates are 
higher in some populations, including black 
and Hispanic women, as well as women livingin 
the south, than in the United States as a whole. 


Strategies for better coverage 
Reaching the WHO’s goal will also be challeng- 
inginlow- and middle-income countries such 
as India and Kenya, where both cost and poor 
health-care infrastructure pose a barrier. Cer- 
vical cancer rates are relatively high in many of 
these places because Pap testing has not really 
filtered down, says Surendranath Shastri from 
the MD Anderson Cancer Center. 

Shastrihas worked onstudies of several alter- 
native approaches to cervical cancer screening 
in India, where laboratory facilities and trained 
pathologists who can interpret Pap smears 
are scarce outside large cities. He and his col- 
leagues have shown that a test knownas VIA, 
in which the cervix is coated with acetic acid 
and then inspected for white areas that may 
signify cancer or precancerous lesions, canbe 
performed by community health workers and 
reduces cervical cancer mortality. 

Self-sampling, in which women swab their 
own cervical tissue for HPV testing, could help 
toincrease access to cervical cancer screening in 
areas where women’s health specialists and facil- 
ities for pelvicexams are inadequate. “Being able 
to offer womenaself-collected test has opened 
up alot more options for when and where they 
canbescreened,” says Megan Huchko, a gynae- 
cologist and director of the Center for Global 
Reproductive Health at Duke University in Dur- 
ham, North Carolina, ranked 28thin the Nature 
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cancer vaccination 
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Index for cancer research. For example, Huchko 
and her colleagues working in East Africa have 
had success in offering the screening to women 
at community health fairs and door-to-door vis- 
its by health-care workers; women often receive 
their results by text message. These strategies 
can increase screening coverage from 5% to as 
muchas 70%, they have found. However, the low- 
cost HPV tests that are currently available don’t 
pinpoint the highest risk strains of the virus, so 
some women may be receiving unnecessary 
treatment for medium-risk strains, Huchko says. 

One hundred countries have added the HPV 
vaccination to their recommended vaccine 
schedules since 2006, but most of these are 
high-income countries, and altogether they 
cover only 30% of the global population of girls 
who need the vaccine. This number is slowly 
climbing: both Kenya and Uzbekistan launched 
HPV vaccination programmes in late 2019. 

In high-income countries, the HPV vaccine 
can cost as much as US$100 per dose (at least 
two doses are needed for full protection). Gavi, 
aglobal public-private partnership to increase 
vaccine accessibility, has helped 27 countries 
access the vaccine for as little as $4.50 per dose. 
The advent of HPV vaccines produced more 
cheaply in middle-income countries such as 
China and India and shipped around the world 
will also reduce costs — the first such vaccine 
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The US Food and 
Drug Administration 
approves vaccine 
against 9 HPV strains. 
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The WHO updates 
guidelines on cervical 
cancer prevention and 
control. 
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was approved in China at the end of 2019. But, 
Shastri cautions, “It’s not just about providing 
the vaccines. There are a whole lot of logistics 
that go into the vaccine delivery.” 

Some high-income countries have also strug- 
gled to attainthe cervical cancer screening and 
HPV vaccination rates that willbe necessary to 
eliminate the disease. In the United States, a 
fragmented health-care system limits access 
to screening for many women. And only about 
half ofadolescents aged 13 to17 were up to date 
on their HPV vaccination in 2018. 

Vaccine hesitancy is common in the United 
States, andit hasn’t helped that the HPV vaccine 
is associated with the issue of young women’s 
sexuality. The vaccine was originally advertised 
as ananti-STD for girls rather than an anti-can- 
cer vaccine, Ramondetta says. That led to reluc- 
tance by government officials to mandate the 
vaccine, especially in conservative states, out 
of fear that it would promote promiscuity. In 
addition, gynaecologists were initially respon- 
sible for delivering the vaccine, when most girls 
of the appropriate age would still be cared for 
by paediatricians. Still, Ramondetta says, vac- 
cination rates in the United States “continue 
to go up every year, which is pretty amazing”. 


Sarah DeWeerdt is a science writer in Seattle, 
Washington. 
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The WHO calls for 
elimination of cervical 
cancer as a global 
public health problem. 


2019 

Kenya and Uzbekistan 
launch HPV vaccination 
programmes. 
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Urology fellow, Jeremy Fallot, and nurse, Shauna Harnedy, assist in robotic surgery by Ruban Thanigasalam (out of view) in Sydney, Australia. 


Acloser look 


at arevered robot 


The da Vinci robotic system has become the ubiquitous method 
for prostate removal, but its high cost is raising questions. By Bec Crew 


oved by surgeons and patients alike for 

its ease of use and faster recovery times, 

the da Vinci surgical robot is less inva- 

sive than conventional procedures, and 

lacks the awkwardness of laparoscopic 
(keyhole) surgery. But the robot’s US$2-mil- 
lion price tag and negligible effect on cancer 
outcomes is sparking concern that it’s crowd- 
ing out more affordable treatments. 


There are more than 5,500 da Vinci robots 
globally, manufactured by California-based 
tech giant, Intuitive. The system is usedina 
range of surgical procedures, but its biggest 
impact has been in urology, where it has a 
market monopoly on robot-assisted radi- 
cal prostatectomies (RARP), the removal 
of the prostate and surrounding tissues to 
treat localized cancer. Uptake in the United 
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States, Europe, Australia, China and Japan for 
performing this procedure has been rapid. In 
2003, less than 1% of surgeons in the US per- 
formed a RARP in preference to open or lap- 
aroscopic surgery. By 2014, RARP accounted 
for up to 90% of radical prostatectomies 
across the country. When it comes to pros- 
tate cancer surgery in the United States, says 
Benjamin Davies, surgeon and professor of 
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urology at the University of Pittsburgh, “the 
die is cast; there is only robotic surgery”. 

After lung cancer, prostate cancer is the sec- 
ond most common cancer in men worldwide. It 
affects the walnut-sized prostate gland, which 
sits up against the urethra, between the rec- 
tum and bladder, and secretes prostate fluid, 
acomponent of semen. The prostate’s proxim- 
ity to the blood vessels, muscles and a fragile 
web of nerve bundles that control erectile and 
bladder function, demands extreme surgical 
precision in its removal, a procedure that is 
generally recommended if the disease has not 
yet spread. Whereas an open patient needs 
to be cut from naval to pubic bone in order to 
access the prostate, a robot-assisted proce- 
dure requires a few small abdominal incisions. 

Known as a master-slave system, the da 
Vinci comprises three main components. The 
tower (or ‘slave’) wields three arms equipped 
with instruments such as forceps, hooks and 
needle-drivers, and a fourth holds cameras 
capable of 15 times magnification. The con- 
sole (‘master’) is where the surgeon sits, afew 
metres from the patient, remotely operating 
the robot arms while watching through a 3D 
stereoscopic monitor. A separate cart contains 
image-processing equipment. 

Surgeons prefer to use the da Vinci robot 
because it offers improved visualization and 
hand and wrist flexibility, and they can be 
seated throughout the 2- to 4-hour proce- 
dure. “We can see the anatomy of the prostate 
like we have never seen it before,” says Fred- 
die Hamdy, Nuffield professor of surgery and 
urology at the University of Oxford, UK, which 
is ranked 26th in the Nature Index for cancer 
research output. 


Cancer outcomes equal 

Whether these improvements translate to 
better long-term outcomes for the patient, 
however, remains unclear. Ruban Thaniga- 
salam, associate professor of robotic surgery 
at the University of Sydney and clinical lead 


Success of robotic surgery lies solely in the 
skill of a surgeon, says Ruban Thanigasalam. 


in prostate cancer research at the Institute of 
Academic Surgery in Australia, is conducting 
atrial comparing open and robotic surgery. 
The preliminary results support what has been 
widely accepted by surgeons for years: robot- 
ic-surgery patients experience reduced blood 
loss, less pain and shorter recovery time, but 
the longer-term outcomes are equivalent. 
“Anecdotally, we find that recovery of con- 
tinence is earlier in the robotic group, but 
after 12 months, there is no major difference 
between the two for urinary control and sexual 
function,” says Thanigasalam. For the cancer 
itself, he adds, the outcomes are the same. 
“Several international studies looking at 
tens of thousands of patients have all shown 
that there is absolutely no difference in cancer 
outcomes between robotic and open surgery.” 


Thanigasalam stresses that the outcomes 
of robotic surgery remain dependent on the 
surgeon’s skills, asentiment echoed by Davies: 
“It’s always the surgeon’s hands, not the tech- 
nology we use.” 

Even da Vinci’s proponents acknowledge 
the temptation to overplay its ability. “We 
all love a good robot,” says Richard Sullivan, 
professor of cancer and global health at King’s 
College London and director of the Institute of 
Cancer Policy inthe United Kingdom. “Human 
beings, particularly surgeons, are incredibly 
neophilic. We love this sort of thing, it gives 
us authority. And the patient will think that, 
because you've got all of this fancy kit, you 
must have better outcomes. But that’s not 
true, the robot is not an indicator of quality.” 


Accessibility gap 

According to a 2017 report by the Royal Aus- 
tralasian College of Surgeons and Australian 
health insurance provider, Medibank, the cost 
of a prostate cancer procedure varied nation- 
wide from Aus$14,553 to Aus$55,928 (US$9,165 
to US$35,222). The use of robotics, the report 
states, “can substantially increase the cost”. 

Despite questions over value for money, 
business is booming. In 2018, the global sur- 
gical robots market was worth US$6.8 billion, 
and it’s predicted to hit $17 billion by 2025. In 
response to the surge in robotic surgery, the 
US Food and Drug Administration (FDA) urged 
patients and health-care providers to exercise 
caution last year, particularly with regards to 
breast and cervical cancer, citing a lack of 
long-term evidence. “The problem is, once it 
becomes adopted, it can be very difficult to 
pedal it back,” says Hamdy. 

A “massive inequality gap” is opening 
between hospitals that can afford the robot, 
and those that can’t, says Sullivan. “In many 
countries, we're fighting for patients because 
of choice and competition. If I’ve got a robot, | 
can sell that fact to patients, and they'll come 
to me rather than the centre down the road.” 
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University performs 
the first nerve-sparing 
radical prostatectomy, 
making it possible 

to preserve sexual 
function and urinary 
continence in some 
patients. 
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Intuitive, da Vinci's 
manufacturer, is 
founded by surgeon 
Frederic Moll, engineer 
Robert Younge, and 
venture capitalist John 
Freund. 


The first commercial 
sale of a da Vinci 
robotic system is made 
to the Leipzig Heart 
Center in Germany. 
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The da Vinci is the 
first robotic system to 
gain FDA approval for 
general laparoscopic 
surgery. 


An account of the first 
robotically assisted 
radical prostatectomy, 
which was performed 
using a da Vinci 
system, is published in 
the BJU International by 
J. Binder and W. Kramer 
at the Johann Wolfgang 
Goethe University in 
Germany (Binder, J. & 
Kramer, W. BJU Int. 87, 
408-410; 2001). 
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World Health Organization found that com- | Changes in average five-year survival rates in common cancer types in the United States since the early 1970s. 
tee . . . The five-year survival rate is the percentage of people who live longer than five years after diagnosis. The cost of 

petition between hospitals with and without cancer diagnosis and treatment globally rose 27% to US$107 billion between 2010 and 2015, according to the 

surgical robots contributed to the closure of | jms institute for Healthcare Informatics. It is projected to rise again by 40% to $150 billion in 2020 in constant 


25% of radical prostatectomy centres inthe | dollar terms. 
English National Health Service. This focus on 
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A. Aggarwal in Reducing Social Inequalities in 
Cancer: Evidence and Priorities for Research, 
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IARC Monograph, 2019). ; ; 
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like Zambia or India, I’m screwed if I’ve only 
done minimally invasive or robotics. Outside 
the high-income settings, these services aren't 
available.” 

Competition could drive the price of the 
da Vinci robot down, such as from UK-based 
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Improved screening could see fewer men | Deaths from cancers globally in 2017, when there were 9.56 million cancer deaths overall. Lung cancer and 
P : - breast cancer are the two most common types but lung cancer was the biggest cause of cancer death that year 
undergoing surgery in the first place. There is while breast cancer was 5th. Colorectal cancer is the 3rd most common type and was the 2nd biggest cause of 
evidence that the benefits of the prostate-spe- | cancer deaths in 2017. 


cific antigen (PSA) blood test, which, along 
with a digital rectal examination, is the most 
common way to screen for prostate cancer, Tracheal, bronchus 
may not outweigh the potential harm of mis- and lung cancer 
diagnosis leading to unnecessary surgery or 
radiation. Researchers from the Queen Mary 
University of London and University of East 
Anglia, UK, are developing blood and urine 
tests to be used in conjunction with the PSA. 
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surgeries in the United grows 66% from 
State are robot-assisted | US$312 in 2017 to 
operations. $520 in 2019. Its total 
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Melanoma, the subject 
of the article on $14, 
caused 65,000 deaths in 
2017, ranking 24th for 
mortality by cancer type. 
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Cellular roadblocks trip 
melanoma hijackers 


Success for a class of immunotherapy drugs is 
changing the face of treatment. By Bianca Nogrady 


henJedd Wolchok began working 
in the area of melanoma 20 years 
ago, the average life expectancy 
forapatient with advanced disease 
was six or seven months. 

Now his waiting room is full of people 
coming back for their third or fourth year of 
follow-up, sharing their stories of survival with 
the newly diagnosed, giving hope wherejusta 
decade ago there was little. 

“That gives you a sense of the human impact 
of this,” says Wolchok, a medical oncologist 
and director of the Parker Institute for Cancer 
Immunotherapy at the Memorial Sloan Ketter- 
ing Cancer Center in New York, ranked fifthin 
the Nature Index for cancer research output. 


Transformative treatment 


Behind this transformation in melanoma sur- 
vival rates is a class of drugs called checkpoint 
inhibitors, the first of which was approved nine 
years ago. Checkpoint inhibitors are a form of 
cancer immunotherapy — treatments that 
stimulate the immune response to cancer cells. 
Checkpoint inhibitors are not the first form of 
cancer immunotherapy, but they are, so far, 
among the most successful, particularly in 
melanoma. They’re also having a big impact 
inlungand urinary tract cancers. “Melanomais 
the most sensitive type of cancer to checkpoint 
inhibitors,” says James Larkin, medical oncolo- 
gist at the Royal Marsden Hospital in London. 


But no oneis sure why. Some patients respond 
wellto checkpoint inhibitors, but others don’t 
respondatall, for reasons that are also not yet 
understood. 

Checkpoint inhibitors work by preventing 
tumour cells from hijacking, and therefore 
avoiding, the cellular immune response that 
should eliminate them. Their discovery came 
about in the late 1990s, when two groups of 
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researchers from the United States andJapan 
uncovered a series of interactions between 
cell-surface receptors and proteins that led 
to the death of immune T cells. 

Tcells are the cells that would normally lead 
the charge against cancer and other threats. 
They have a receptor on their surface called 
PD-1(programmed cell death protein1). When 
that receptor is engaged, it triggers the T cell 
to rupture — one of the many checkpoints that 
have evolved to help keep the immune system 
from over-reacting. 

The protein that engages that receptor is 
PD-L1 (PD ligand 1). It turns out that many 
human cancers also produce PD-L1, the fac- 
tor that tumours are using to hijack the check- 
point and engage the T-cell death receptor to 
stop the response against them. 

Scientists showed that inhibiting this 
tumour-hijacked checkpoint could unleash 
an immune response against the tumour. 


Asense of possibility 


The first checkpoint inhibitor drug, ipili- 
mumab, was approved by the US Food and 
Drug Administration in March 2011 for the 
treatment of melanomathat had spread or that 
could not be treated surgically. Compared with 
amelanoma vaccine, itself a new therapeutic 
approach being trialled, the drug significantly 
improved survival rates. Although it worked in 
only around one in five patients, the benefits 
in those patients were dramatic, Larkin says. 
“We really had asense then of the possibilities.” 

Ipilimumab was followed by pembroli- 
zumab in September 2014, and nivolumab 
just three months later. All of these, and newer 
checkpoint inhibitors, are now in widespread 
use, although they’re expensive for patients, 
particularly in countries without public 
health insurance schemes. A course of intra- 
venous checkpoint inhibitor therapy can cost 
US$150,000-250,000 per year. 

The most spectacular results so far with 
checkpoint inhibitor therapy have come from 
trials combining two different checkpoint 
inhibitors, such as ipilimumab and nivolumab. 
Larkin and Wolchok were both involved in 
the CheckMate 067 study, which began in 
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July 2013 and compared ipilimumab alone 
with nivolumab alone, and with ipilimumab 
plus nivolumab in 945 people with advanced 
untreated melanoma. 

“It was a blinded trial, so you didn’t know 
which treatment the patients were getting,” 
Larkin says. “Andit was really striking that some 
patients who had symptoms or were quite sick 
were improving really, really quickly, which 
we'd never seen before.” 

The combination was so successful that a 
paper published in the New England Journal 


The treatment of many melanomas is now starting with immune checkpoint inhibitors, rather than major surgery. 


of Medicine in late 2019 showed that 52% of 
patients were alive after five years, compared 
with 44% of patients on nivolumab alone and 
26% of patients on ipilimumab alone (J. Larkin 
etal. N. Engl. J. Med. 381, 1535-1546; 2019). As 
often with clinical trials, checkpoint inhibitors 
were first tested in the most severely affected 
patients, those whose cancer was untreatable 
with surgery or which had spread despite exist- 
ing treatments. But with each new trial show- 
ing unprecedented survival rates, questions 
would arise as to whether these drugs should 


be used earlier in the disease, even before it 
had spread. 

Grant McArthur, a medical oncologist and 
head of the molecular oncology laboratory 
at the Peter MacCallum Cancer Centre in Mel- 
bourne, Australia, says checkpoint inhibitors 
have brought a paradigm shift in the manage- 
ment of melanoma. “We see patients, who 
previously would have had large, complex 
surgical procedures that are associated with 
substantial morbidity, who now will start with 
the immune checkpoint inhibitors,” he says. 
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“The idea that immunotherapy could replace 
surgery is being entertained for the first time.” 

It’s notall good news. Checkpoint inhibitors 
comewithsome potentially serious sideeffects, 
many as a result of an over-active immune 
response, which is linked to inflammation in 
the bowel, lung, heart, skin and other organs. 
Andaround half of the patients with advanced 
disease don’trespondas spectacularly, or atall, 
to checkpoint inhibitors. 

Some survive longer than they might have 
done without treatment, or have a longer 
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period until their disease progresses. However, SELECTED TOP FUNDERS OF CANCER RESEARCH 
the CheckMate 067 study found that 48% of — Although the US National Cancer Institute (NCI) is by far the world’s biggest funder of cancer research, China's 
patients nasicledlwlcnin tive vents: despite: eT emirate 
treatment with a combination of checkpoint 
inhibitors. There’s palpable frustration over 
why no one can explain this. It’s an active area of 
research, and there are early suggestions about 
what might be the deciding factors. One clue is 
that people who seem to get the most benefit 
from checkpoint inhibitors are those whose 
immune systems arealready puttingupafight Medical Research Council (UK) 
when they start treatment, says Wolchok. National Natural Science 
“The best evidence for that comes from Foundation of China 
pathology studies, which have shown that 


Grant funding, 2010-19 


National Cancer Institute (US) 


European Commission 


tumours that already have T cells inthem are ° : 1 - = = 
the ones where you see responses,” he says. US$ (billions) 

“What the checkpoint inhibitors are doing 

in general is allowing a pre-existing immune Average grant amount (number of grants in brackets) 


response to become more effective.” 


There’s also evidence that patients with “ational Cancer Institute (US) pgs! 
cancers caused byacertain genetic condition EurbpesnGermiasion [2,292] 
called mismatch repair deficiency may actu- 
ally respond better to checkpoint inhibitors, — Medical Research Council (UK) [1,814] 
regardless of their cancer type. agonal Natal Sciences 
: : [33,728] 

Foundation of China 
Into the unknown 
Another feature that seems to be linked to : oe 10 ss a 
better response rates is what’s called the US$ (millions) 


mutation burden of the tumour, the number 


of genetic mutations present in the genome RESEARCH BY SECTOR 


of an individual’s cancer Just as exposure to Sectoral contributions to cancer articles over the period according to their Nature Index Share, a fractional 
‘ ‘ 7 count that takes into account the proportion of authors from that sector on any given article. Total Share for 
cigarette smoke causes the mutationsthatare cancer articles across all sectors was 20,208; the figure for total Share by sector is larger because some 


common to lung cancers, exposure to ultra- _ institutions belong to multiple sectors. 

violet radiation causes a set of mutations that 

are common features of skin cancer. But indi- Non-profit organization: 2,000.61 
viduals with skin cancer that grows in parts Academic: 14,920.83 Health care: 5,008.06 
of the body that are less exposed to the sun 
may have a lower mutation burden, and that 
seems to make them less likely to respond to 
checkpoint inhibitors. LIVING AND DYING 


“The hypothesis is that cancersthat havea incidence and mortality rates in different regions. Accuracy may vary owing to limited data quality and coverage, 
lot of mutations have many abnormal-appear- particularly in low- and middle-income countries. According to the World Health Organization, approximately 


ing proteins, which makes them look different 70% of deaths from cancer occur in those countries. 
fromthe normal cell that they came from,” says 
Wolchok. “That is something that the immune 
system at baseline is able to survey for.” 
Given the survival rates among people who 
dorespond to checkpoint inhibitors, is it time 


Government: 1,405.30 
Corporate: 621.46 


Australia and New Zealand 


North America 


to start talking about a cure for melanoma? Europe 
Oncologists are wary of the word, preferring 
to talk about long-term survivorship, which is South America 


itself a novel concept in melanoma. 

“If you’ve no longer got a disease that 20 
years ago had a survival of six to nine months, Southeast Asia 
and it turns out that you’re a long-term sur- 


World 


Africa (excluding southern 


SOURCE: DIMENSIONS FROM DIGITAL SCIENCE ; GLOBOCAN 2018/WHO; WHERE CREDITED AS SUCH, DATA USED IN THIS SUPPLEMENT WERE SOURCED FROM DIMENSIONS, AN INTER-LINKED RESEARCH INFORMATION SYSTEM PROVIDED BY DIGITAL SCIENCE (HT TPS://WWW.DIMENSIONS.AI) 


vivor, what does that look like? Larkin asks. Africa) Hl incidence 

“Curing metastatic solid tumours isn’t some- ; = ; 

thing that we’ve ever really faced before.” South-central Asia Mortality 

Bianca Nogrady is a science writer in the 500 400 300 200 100 fe) 100 200 
Blue Mountains, Australia. Age standardized rate per 100,000 people 
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Coming at cancer 
from allangles 


The search for disease mechanisms and treatments 
is one of the biggest collaborative efforts in science. 
These researchers are significant contributors. 


4a t’simpossible to do cancer research 

on your ownif you want to dosome- 

thing that makes sense for the dis- 

ease,” says lung cancer researcher, 

Niki Karachaliou, referring to the 

diverse teams of physicians, clinicians and 

other researchers whose different perspec- 

tives help create a shared understanding of 
this complex disease. 

Karachaliou and two other researchers 
whose collaboration networks are shown here 
were selected for the strength of their publi- 
cation count in Dimensions. They were drawn 
from an elite group of researchers in cancer, 
who were authors in Nature Index between 
2015 and 2019 and whose first authorship onan 
article in Dimensions dates between 2010 and 
2014. Authors from the United States dominate 
the group, as might be expected, giventhe US 
leads the field. Although the closest connec- 
tions for two of the collaboration networks 
shown here are domestic, the third reflects 
activity between the United States and China, 
the top two countries for cancer research. 

Among the top five countries for cancer 
research in the index, the United States and 
China are the most self-sufficient, with inter- 
nationally collaborative articles on cancer 
comprising only 51.8 and 49.8%, respectively, of 
their total cancer articles. For the third-ranked 
country, the United Kingdom, 81.6% of cancer 
articles are internationally collaborative, and for 
Germany, fourth, 80.7% of cancer articles share 
authorship with researchers outside Germany. 


Number cruncher: A biostatistician 
leverages her maths expertise to 
improve cancer care 

Heidi Kosiorek 

Annualaverage publications count:30.2 


Predicting how various treatments could 


MELISSA VALLADARES FOR NATURE 


y Aad affect an individual cancer patient requires 
Biostatistician Heidi Kosiorek works at the crossroads of maths and medicine, sifting through more than a medical understanding of the 
data to weigh the best course of action based on cancer types and patient profiles. genetics and molecular mechanisms under- 


lying the disease. It involves sifting through 
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WELL-CONNECTED 


The graphs show each profiled researcher's top ten research collaborators, ranked by the number of papers they have co-authored, and their 
collaborators’ top ten co-authors. In this series of graphs, each node represents a researcher, with line widths sized by the strength of the 
collaboration. Lines connect the profiled researcher (square) to their co-authors (numbered nodes). Not all collaborators mentioned in the text will be 
reflected in these networks, depending on the number of resulting papers to date. 


reams of data to identify, by weight of num- 
bers, the tumour and patient characteristics 
that could influence success or failure. It’s 
about separating the trends from the flukes, 
the biomarkers from the outliers. 

This first occurred to Heidi Kosiorek in the 
mid-1990s, during an internship working asa 
research assistant in the emergency depart- 
ment ofan Ohio hospital. “I think it’s even more 
true today” at a time when personalized and 
precision medicine have become buzzwords, 
says Kosiorek, now a biostatistician at the 
Mayo Clinic’s Scottsdale, Arizona, campus. 

Kosiorek had planned to go to medical 
school, but with her eyes opened to the inter- 
section between mathematics, which had 
long been her strength, and medicine, she 
pursued biostatistics instead. That decision 
kicked offa prolific career. Since 2015, Kosiorek 
has authored or co-authored an average of 
30 publications per year. For more than a 
decade, she worked at the University Hospitals 
Case Medical Center in Cleveland on studies 
of ovarian, endometrial and cervical cancers. 
Since joining the Mayo Clinic in late 2014, she’s 
developed expertise in breast cancer, and often 
teams up with Mayo Clinic colleagues, Bar- 
bara Pockaj, asurgeon, and Donald Northfelt, 


Profiled researcher 

Top collaborator 

Other collaborators 

° More co-authored papers 

e Fewer co-authored papers 


amedical oncologist, toinvestigate topics such 
as tumour genetics and detecting recurrence. 

The Mayo Clinic receives nearly US$120 mil- 
lion in annual grant funding for cancer 
research, keeping biostatisticians at its three 
major campuses busy. Choosing which pro- 
jects to work on “is achallenge, for sure”, says 
Kosiorek, “because I want to do it all”. 

As an assistant supervisor, Kosiorek helps 
assign roughly 20 statisticians and statistical 
programmers to projects and oversees their 
work. Her mentorship of many junior scien- 
tists partly explains her impressive publication 
rate. “You end up beinga part of more projects 
because of that,” she says. 

Kosiorek’s collaborations extend beyond 
the Mayo Clinic. She is the lead biostatisti- 
cian for the Myeloproliferative Neoplasms 
Research Consortium, a US National Cancer 
Institute-funded initiative to improve treat- 
ment for a rare chronic blood cancer that 
sometimes develops into leukaemia. She 
works closely with researchers from several 
institutions across the United States and Can- 
ada that are part of the consortium. 

A crucial part of individualized medicine 
is taking into account the patient’s prefer- 
ences regarding outcomes and quality-of-life 
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TOP COLLABORATORS FOR 
HEIDI KOSIOREK 


1. Amylou Dueck 
Mayo Clinic, United States 


2. Ruben Mesa 
The University of Texas MD Anderson 
Cancer Center, United States 


3. Jeanne Palmer 
Mayo Clinic, United States 


4. Donald Northfelt 
Mayo Clinic, United States 


5. Barbara Pockaj 
Mayo Clinic, United States 


6. Curtiss Cook 
Mayo Clinic, United States 


7. Karen Anderson 
Arizona State University, United States 


8. Robyn Scherber 
The University of Texas MD Anderson 
Cancer Center, United States 


9. Nina Karlin 
Mayo Clinic, United States 


10. Patricia Verona 
Mayo Clinic, United States 


impacts. Withthatin mind, Kosiorek and Pockaj 
are running a study on follow-up surgeries 
after breast reconstruction. They want to cate- 
gorize the kinds of procedures that are needed 
for cosmetic considerations, for example, and 
how often, so that breast-cancer patients can 
make more informed decisions when consid- 
ering reconstruction after a mastectomy. 
“What drives me is helping physicians find 
what’s best for their patients through the data,” 
Kosiorek says. “There are many days where it 
doesn’t really feel like work.” Sarah DeWeerdt 


Biomedical naturalist: A biomedical 
engineer harnesses an organic drug 
delivery system — the body’s own cells 
Chao Wang 

Annual average publications count: 13.4 


Immunotherapy, which harnesses the immune 
system to attack tumour cells, is one of the hot- 
test fields in cancer research. One approach 
involves attaching immune-boosting drugs to 
nanoparticles made of gold or iron oxide and 
injecting them into the patient. 

In his pursuit of new cancer-fighting drugs, 
biomedical engineer Chao Wang eschews such 
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TOP COLLABORATORS FOR 
CHAO WANG 


1. Zhuang Liu 
Soochow University, China 


2. Liang Cheng 
Soochow University, China 


3. Chao Liang 
Soochow University, China 


4. Yonggang Li 
The First Affiliated Hospital of Soochow 
University, China 


5. Zhen Gu 
University of California, Los Angeles, 
United States 


6. Quanyin Hu 
Massachusetts Institute of Technology, 
United States 


7. Qian Chen 
Soochow University, China 


8. Wujin Sun 
University of California, 
Los Angeles, United States 


9. Jicheng Yu 
The University of North Carolina at 
Chapel Hill, United States 


10. Jinqiang Wang 
University of California, 
Los Angeles, United States 


synthetic materials and looks instead to more 
organic drug carriers — the body’s own cells. 
“Nature is the best engineer,” he says. 
Inaddition to being 
expensive to produce, 
metallic nanopar- 
ticles could poten- 
tially be toxic over 
the long term, says 
Wang. He’s investi- 
gating the humblered 
blood cell, the most 
abundant cell type in 
the human body, asa 
safer alternative. “Red 
blood cells may be the 
ideal carrier,” he says. 
In 2019, Wang and his team described how 
immune-stimulating molecules called anti- 
gens were administered to mice using their 
own red blood cells. The cells were extracted, 
modified with antigens, and then re-injected, 
where they were taken up by the spleen. As 
reported in Science Advances, the treatment 
spurred the immune systems of the mice, 
which helped to slow the tumour growth and 
increase survival rates (X. Han et al. Sci. Adv. 
5, eaaw6870; 2019). This was the first major 


Chao Wang. 
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result from Wang’s lab since its launch in 
2018 at Soochow University in Suzhou, China, 
roughly 100 kilometres west of Shanghai. 

In principle, different types of cells could be 
used to target the immune system in different 
parts of the body. For instance, fresh red blood 
cells, which are responsible for circulating 
oxygenin the body, could be used as transport 
for drugsthat target the lungs. “Wecan doalotof 
fancy things with these simple cells,” says Wang. 

Before he moved to Suzhou, Wang was a 
postdoctoral fellow inthe United States, where 
he worked with his adviser, Zhen Gu, then at 
the University of North Carolina at Chapel Hill, 
to assess the potential of platelets (tiny blood 
cells) to deliver a class of drugs called immune 
checkpoint inhibitors. 

Checkpoint inhibitors block the mech- 
anisms that usually keep the immune 
system incheck, allowing it to be unleashed on 
cancercells. Butthiscanhaveserioussideeffects 
if it also attacks healthy cells. Because plate- 
lets naturally migrate to sites of inflammation, 
they can carry checkpoint inhibitors toamore 
targeted site. Surgical wounds, for example, 
where atumour has been removed area good 
place to tackle any residual cancer cells. 

According to their 2017 Nature Biomedical 
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Engineering paper, Wang and Gu’s platelet 
therapy technique reduced cancer recur- 
rence in mice, allowing 75% to survive after 
60 days. No mice in the control group survived 
(C. Wang etal. Nature Biomed. Eng.1,0011; 2017). 

In North Carolina, Wang says he learnt the 
importance of working closely with clinicians 
and doctors, including bringing them into 
group meetings. It’s a philosophy he took back 
to Soochow, where he works with professors, 
doctors and students from the university’s 
medical school. “You cannot just do it in your 
lab by yourself” he says. “Youneed contact with 
the clinicians and the doctors to know which 
problems you want to address in real life.” 
Mark Zastrow 


Target hunter: Uncovering the 
mechanism that drives drug 
resistance in lung cancer 

Niki Karachaliou 

Annualaverage publications count:25.4 


Lung cancer kills more people every year than 
any other type of cancer worldwide, and smok- 
ingis its leading cause. But Niki Karachaliou’s 
research focuses on FGFR-positive lung cancer, 
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which is more common among non-smokers 
thansmokers, and caused by a mutation in the 
EGFR gene, which triggers rapid growth and 
division. 

Although FGFR-inhibiting drugs such 
as gefitinib (sold as Iressa) and erlotinib 
(known as Tarceva) block the effects of this 
particular mutation, new treatment-resist- 
ant mutations tend to emerge within a year 
in almost all patients, says Karachaliou. 
This allows disease progression, leaving 
doctors with little course of action. 

“There are still too many failures, with few 
patients responding welltotargeted therapies,” 
says Karachaliou, who moved from Spain to 
become medical director of the GCD (Global 
Clinical Development) Oncology division at 
Merck, in Darmstadt, Germany, in 2019. “It’s 
devastating telling patients that we can only 
keep them disease-free for a few months.” 

In 2019, Karachaliou’s team identified two 
potential drug targets in lung-cancer patients 
who have been treated unsuccessfully with 
EGFR-inhibitors. By analysing tumour sam- 
ples, they found that the disease progressed 
6-12 months earlier in patients with high levels 
of mutated enzymes called ILK and SHP2 (N. 
Karachaliou et al. EBioMedicine 39, 207-214; 
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2019). The findings are being used to develop 
treatments that complement EGFR therapies. 

Using liquid biopsies, Karachaliou has also 
uncovered newclasses of mutationsinthe BRAF 
gene, which produces a proteininvolved in cel- 
lular signalling. The liquid biopsy is anew test 
that captures changing tumour signatures in 
the blood, providing cancer researchers with 
a“more complete picture of what is happening 
at the time of progression’, says Karachaliou. 

Collaboration has driven Karachaliou’s 
career since she graduated as a medical stu- 
dent from the University of Athens in Greece. 
She has co-authored more than 200 papers, 
many of which were with her collaborator and 
mentor, Rafael Rosell, who leads the Dr Rosell 
Oncology Institute in Barcelona, Spain. Kara- 
chaliou began working with Rosell at Barce- 
lona’s Quir6n Dexeus University Hospital in 
2012, while she was completing her PhD. 

Between patient visits and lab work, Kara- 
chaliou worked with a diverse team of clini- 
cians, physicians and other researchers. “It was 
very interactive, which is important in oncol- 
ogy,” says Karachaliou. “It’s impossible to do 
cancer research on your own if you want to do 
something that makes sense for the disease.” 
Gemma Conroy 
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TOP COLLABORATORS FOR 
NIKI KARACHALIOU 


1. Rafael Rosell 
Institute for Health Science Research 
Germans Trias i Pujol, HUGTP ICS, Spain 


2. Santiago Viteri 
Hospital Universitario Quiron Dexeus, 
BMA, Spain 


3. Ana Gimenez Capitan 
Hospital Universitario Quiron Dexeus, 
BMA, Spain 

4. Jordi Bertran Alamillo 
Hospital Universitario Quiron Dexeus, 
BMA, Spain 

5. Miguel Angel Molina 
Hospital Universitario Quiron Dexeus, 
BMA, Spain 

6. Ana Drozdowskyi 
Institute for Health Science Research 
Germans Trias i Pujol, HUGTP ICS, Spain 


7. Jordi Codony-Servat 
Hospital Universitario Quiron Dexeus, 
BMA, Spain 


8. Jillian Wilhelmina Paulina Bracht 
Hospital Universitario Quiron Dexeus, 
BMA, Spain 

9. Imane Chaib 


Institute for Health Science Research 
Germans Trias i Pujol, HUGTP ICS, Spain 


10. Maria Gonzalez Cao 
Hospital Universitario Quiron Dexeus, 
BMA, Spain 


Niki Karachaliou searches for the mechanism 
that allows tumours to resist treatment. 
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